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HEPITITIS C VTRUS EPITOPES 

1. Field of Invention 

10 This invention relates to specific peptide viral anti- 

gens which are immunoreactive with sera from patients in- 
fected with par enter ally transmitted non-A, non-B hepatitis 
virus (PT-NANBH, now called Hepatitis C Virus) , to poly- 
nucleotide sequences which encode the peptides, to an ex- 

15 pression system capable of producing the peptides, and to 
methods of using the peptides for detecting PT-NANBH infec- 
tion in human sera. 

2 . References 
20 Atkins, et al., Cell £2:413-423 (1990). 

Ausubel, F. M., et al., Current Protocols in Molecular 
Biology . John Wiley and Sons, Inc., Media PA. 

Bradley, D.W., et al., J. Infec. Dis., 148:2 (1983). 
Bradley, D.W. , et al., J Gen. Virol., 69:1 (1988). 
25 Bradley, D.W. et al., Proc. Nat. Acad. Sci. , USA, 

84:6277 (1987). 
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science, Chapter 10. 

Dienstag, J.L., et al, Sem Liver Disease, 6:67 (1986) 
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Cold Spring Harbor Laboratories, Cold Spring Harbor, NY 
(1972). 

Mullis, K. , U. S. Patent No. 4,683,202, issued July 
28, 1987. 

Mullis, K., et al., u. S. Patent No. 4,683,195, issued 
July 28, 1987. 

Reyes, G. , et al, Science, 247:1335 (1990). 
Sanger, F. , et al., Proc. Natl. Acad. Sci. USA 24:5463 
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Smith. D.B., et al, Gene, 67:31 (1988). 
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Tijssen, P., Practice and Theory of Immunoassay s f 
Scientific Publishing Co., NY, NY, 1985, p. 335, 

Wilson, W. , et al., Cell 55:1159-1169 (1988). 
Woo, S. L. c, Methods in Enzymology 68 : 389 (1979). 
5 Young, R. A. and R. W. Davis, Proc. Natl. Acad. Sci. 

USA 80, 1194-1198 (1983) . 

3 . Background 

Viral hepatitis resulting from a virus other than 
10 hepatitis A virus (HAV) and hepatitis B virus (HBV) has 
been referred to as non-A, non-B hepatitis (NANBH) . More 
recently, it has become clear that NANBH encompasses at 
least two, and perhaps more, quite distinct viruses. One 
of these, known as enterically transmitted NANBH or ET- 
15 NANBH, is contracted predominantly in poor-sanitation areas 
where food and drinking water have been contaminated by 
fecal matter. The molecular cloning of a portion of this 
virus, referred to as the hepatitis E virus (HEV) , has 
recently been described (Reyes et al . ) . 
20 The second NANB virus type, known as parenterally 

transmitted NANBH, or PT-NANBH, is transmitted by parenter- 
al routes, typically by exposure to blood or blood prod- 
ucts. Approximately 10% of transfusions cause PT-NANBH 
infection, and about half of these go on to a chronic 
25 disease state (Dienstag) . 

Human sera documented as having produced post- trans- 
fusion NANBH in human recipients has been used successfully 
to produce PT-NANBH infection in chimpanzees (Bradley) . 
RNA isolated from infected chimpanzee sera has been used to 
3 0 construct cDNA libraries in an expression vector for immu- 
noscreening with chronic-state human PT-NANBH serum. This 
procedure identified a PT-NANBH specific cDNA clone and the 
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viral sequence was then used as a probe to identify frag- 
ments making up 7,300 contiguous basepairs of a PT-NANBH 
viral agent (EPO patent application 88310922.5, filed 
11/18/88) . The same procedure was used by the present 
inventors to derive two of the PT-NANBH peptide and poly- 
nucleotide sequences disclosed herein. The sequenced viral 
agent has been named HCV (HCV) (above EPO patent applica- 
tion) . 

Heretofore, one immunogenic peptide encoded by the HCV 
viral agent has been reported (Choo, Kuo, EPO application 
88310922.5) . This peptide, designated C-100, has been used 
in immunoassays of PT-NANBH sera and found to react immuno- 
specifically with up to 80% of chronic NANBH samples, and 
about 15% of acute NANBH samples (Kuo) . 

It is desirable to provide one or a collection of pep- 
tide antigens which are immunoreactive with a greater per- 
centage of PT-NANBH-inf ected blood, including both acute 
and chronic PT-NANBH infection. 

20 4. Summary of the Invent inr> 

It is one general object of the invention to provide 
recombinant polypeptides immunoreactive with sera from 
humans infected with hepatitis C virus (HCV) , including a 
peptide which is immunoreactive with a high percentage of 

25 sera from chronic HCV-inf ected individuals, and peptides 
which are immunoreactive with sera associated with acute 
HCV infection. 

It is another object of the invention to provide an 
HCV polynucleotide sequence encoding a sequence for recom- 
30 binant production of the peptide antigens, and a diagnostic 
method for detecting HCV-inf ected human sera using the pep- 
tide antigens. 
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The invention includes, in one aspect, a peptide anti- 
gen which is innnunoreactive with sera from humans infected 
with HCV. One peptide antigen in the invention includes an 
innnunoreactive portion of an HCV polypeptide which: 
5 a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID N0:4. 

Other peptide antigens of the invention include an im- 
10 munoreactive portion of any one of the following sequences: 
SEQ ID NO:2, SEQ ID N0:8, SEQ ID NO:10, SEQ ID NO:12, SEQ 
ID N0:14, SEQ ID NO:16 f SEQ ID N0:18, SEQ ID N0:20, SEQ ID 
NO:22, SEQ ID N0:24, and SEQ ID NO:26.. 

In another aspect, the invention includes diagnostic 
15 kits for use in screening human blood containing antibodies 
specific against HCV infection. The kit includes at least 
one peptide antigen which is immunoreactive with sera from 
humans infected with hepatitis c virus (HCV): specific 
peptide antigens for use in the kit are given above. 
20 one preferred embodiment of the present invention is 

a diagnostic kit containing the 409-1-1 (c-a) (SEQ ID NO: 8) 
and one of the HCV-capsid derived proteins (SEQ ID N0s:l2, 
14, 16, 18, 20, 22, 24, and 26): two particular embodi- 
ments being 409-1-1 (c-a) with the C1NC450 caps id-derived 
25 peptide, and 409-1-1 (c-a) with the C1NC360 caps id-derived 
peptide. 

In one embodiment of the present invention, the 
antigen is immobilized on a solid support. The binding of 
HCV-specific antibodies to the immobilized antigen is 
30 detected by a reporter-labeled anti-human antibody which 
acts to label the solid support with a detectable reporter. 
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The kit is used in a method for detecting HCV infec- 
tion in an individual by: (i) reacting serum from an HCV- 
inf ected test individual with the above peptide antigen 
and (ii) examining the antigen for the presence of bound 
5 antibody. 

The peptide antigens are produced, in accordance with 
another aspect of the invention, using an expression system 
for expressing a recombinant peptide antigen which is immu- 
noreactive with sera from humans infected with hepatitis C 
10 virus. (HCV) . A selected expression vector containing an 
open reading frame (ORF) of a polynucleotide which encodes 
the peptide is introduced into a suitable host, which is 
cultured under conditions which promote expression of the 
ORF in the expression vector. 
15 in one embodiment, the polynucleotide is inserted into 

an expression site in a lambda gtll phage vector, and the 
vector is introduced into an E. coli host. The following 
E. coli hosts have been deposited which contain vectors 
including the coding sequences of the antigens shown in 
20 parenthesis: ATCC No 40901 (SEQ ID NO:3), ATCC NO. 40893 
(SEQ ID NO:l), and ATCC No. 40792 (SEQ ID NO:7) , and ATCC 
No. 40876 (SEQ ID NO: 9). p GEX and pET are two other 
vectors which have been used to express HCV antigens, it 
will be appreciated that determination of other appropriate 
vector and host combinations for the expression of the 
above sequences are within the ability of one of ordinary 
skill in the art. 

Also forming part of the invention are polynucleotides 
which encode polypeptides immunoreactive with sera from 
humans infected with hepatitis C virus (HCV) . one polynu- 
cleotide of the present invention encodes a polypeptide 
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wherein the polypeptide includes an immunoreactive portion 
of a peptide sequence which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

5 c) has the carboxy-terminal sequence presented as SEQ 

ID NO: 4; and, where the carboxy- terminal amino acid se- 
quence of said peptide antigen is encoded by the polynu- 
cleotide sequence presented as SEQ ID NO: 3. 

Other polynucleotides of the invention include any one 
10 of the following sequences: SEQ ID N0:1, SEQ ID NO: 3, SEQ 
ID N0:5, SEQ ID NO:7, SEQ ID N0:9, SEQ ID NO:ll, SEQ ID 
NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID 
NO: 21, SEQ ID NO:23, and SEQ ID NO: 25. 

These and other objects and features of the invention 
15 will become more fully apparent when the following detailed 
description is read in conjunction with the accompanying 
drawings . 

5. Brief Description of the Drawings 

Figure 1 illustrates the steps in producing overlap- 
20 ping linking fragments of a nucleic acid segment, in 
accordance with the methods of the present invention; 

Figure 2 shows the positions of overlap primer regions 
and linking regions along a 7,300 basepair portion of the 
HCV genome, 

25 Figure 3 shows the DNA coding sequence of the clone 40 

insert. The underlined sequences correspond to an primer 
region. 

Figure 4 shows the DNA coding sequence of a clone 3 6 
insert. The underlined sequences correspond, respectively, 
30 to the F 7/ F 8 , and Rg primer regions. 

Figure 5 shows the DNA and protein coding sequences 
for a 409-1-1 (abc) clone insert. The "A" region of this 
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sequence is delineated by boxes, the «B" region by a box 
and a triangle, and the «C« region by a triangle and an 
asterisk. 

Figure 6 shows the DNA and protein coding sequences 
for a 409-1-1 (c-a) clone insert. 

Figure 7 illustrates the groups of clones which have 
been obtained from the HCV genome in the region correspond- 
ing to the 409-1-1 (abc) clone insert. 

Figure 8A shows the DNA and protein coding sequences 
for ..the pGEX-GGl insert. The three G's above the first 
line indicate where substitutions were made to generate the 
clone pGEX-CapA. Figure 8B shows the DNA and protein 
sequences for the pGEX-CapA insert coding sequence. The 
primers used in polymerase chain reactions to generate 
carboxy and amino terminal deletions are indicated below 
the nucleotide line. The sequences of the primers are 
indicated in the sense (coding strand). The actual 
sequence of the NC (non-coding) primers is the reverse 
complement of the indicated sequence. Coding primers are 
underlined ; reverse (noncoding) primers are double-under- 
lined. Sequences shown in capital letters are exact 
matches. Sequences in lowercase letters are "mismatched" 
sequences used to introduce the terminal restriction sites 
(ffcoj at the 5' ends and BamffJ at the 3' ends). The three 
nucleotides which have been altered to remove the -slippery 
codons" at positions 24, 27, and 30 are indicated by bold 
type with the wild type A residues shown above the se- 
quence . 

Figure 9 shows a hydropathicity plot of the HCV-core 
30 protein encoded by pGEX-CapA. The relative location of the 
primers, used to generate carboxy and amino terminal 
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deletions, are indicated relative to the protein coding 
sequence by arrows. 

Figure 10 shows an epitope map of the HCV capsid 
protein region. 

6 - Petailed Descripti on of th*> invention 



I. Definitions 

The terms defined below have the following 
10 meaning herein: 

1. "Parenterally transmitted non-A, non-B hepatitis 
viral agent (PT-NANBH) means a virus, virus type, or virus 
class which (i) causes parenterally transmitted infectious 
hepatitis, (ii) is transmissible in chimpanzees, (iii) i s 
serologically distinct from hepatitis A virus (HAV) , hepa- 
titis B virus (HBV) , and hepatitis E virus (HEV) . 

2. "HCV (HCV)- means a PT-NANBH viral agent whose 
polynucleotide sequence includes the sequence of the 7,300 
basepair region of HCV given in the Appendix, and varia- 

20 tions of the sequence, such as degenerate codons, or 
variations which may be present in different isolates or 
^ strains of HCV. 

3. Two nucleic acid fragments are "homologous" if 
they are capable of hybridizing to one another under 

25 hybridization conditions described in Maniatis et aj^., op^. 
crU, pp. 320-323, using the following wash conditions: 2 
x SCC, 0.1% SDS, room temperature twice, 30 minutes each; 
then 2 x SCC, 0.1% SDS, 50»c once, 30 minutes; then 2 x 
SCC, room temperature twice, io minutes each, homologous 
sequences can be identified that contain at most about 25- 
30% basepair mismatches. More preferably, homologous 
nucleic acid strands contain 15-25% basepair mismatches, 
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even more preferably 5-15% basepair mismatches. These 
degrees of homology can be selected by using more stringent 
wash or hybridization conditions for identification of 
clones from gene libraries (or other sources of genetic 
material) , as is well known in the art. 

4 . A DNA fragment is "derived from" HCV if it has 
substantially the same basepair sequence as a region of the 
HCV viral genome which was defined in (2) above. 

5. A protein is "derived from" a PT-NANBH or HCV 
viral, agent if it is encoded by an open reading frame of a 
cDNA or RNA fragment derived from a PT-NANBH or HCV viral 
agent, respectively. 

11 • Molecular Clone Salm o n bv Tw m,inn Bereen s nn 

As one approach toward identifying a molecular clone 
of a PT-NANBH agent, cDNA libraries are prepared from in- 
fected sera in the expression vector lambda gtll. cDNA se- 
quences are then selected for expression of peptides which 
are immunoreactive with PT-NANBH- infected sera. Recombi- 
nant proteins identified by this approach provide candi- 
dates for peptides which can serve as substrates in diag- 
nostic tests. Further, the nucleic acid coding sequences 
identified by this approach serve as useful hybridization 
probes for the identification of further PT-NANBH coding 
25 sequences. 

In order to make immunoscreening a useful approach for 
identifying clones originating from PT-NANBH coding sequen- 
ces, a well-defined source of PT-NANBH virus is important. 
To generate such a source, a chimpanzee (#771; Example 1A) 
was infected with transmissible PT-NANBH agents using a 
Factor VIII concentrate as a source (Bradley) . The Factor 
VIII concentrate was known to contain at least two forms of 



20 



30 



WO 91/15516 




PCT/US91/02370 



11 

parenterally transmitted NANB hepatitis (PT-NANBH) . In ad- 
dition to a chloroform-sensitive agent, which has subse- 
quently been called HCV (HCV) , a chloroform-resistant form 
of PT-NANBH was also transmitted in the concentrate 
5 (Bradley, 1983): 

In the method illustrated in Example 1, infected serum 
was pelleted, without dilution, by centrifugation, and cDNA 
libraries were generated from the resulting pelleted virus 
(Example IB and 1C) . Sera from, infected human sources were 
10 treated in the same fashion. cDNA libraries were generat- 
ed, e.g., by a random primer method using the RNA extracted 
from pelleted sera as starting material (Example IB and 
1C) . The resulting cDNA molecules were then cloned into a 
suitable vector, for example, lambda gtll, for expression 
15 and screening of peptide antigens, and lambda gtlO, for 
hybridization screening (Example lC(iv)). Lambda gtll is 
a particularly useful expression vector which contains a 
unique EcoRI insertion site 53 base pairs upstream of the 
translation termination codon of the beta-galactosidase 
20 gene. Thus, an inserted sequence is expressed as a beta- 
galactosidase fusion protein which contains the N-terminal 
portion of the beta-galactosidase gene, the heterologous 
peptide, and optionally the C-terminal region of the beta- 
galactosidase peptide (the C-terminal portion being ex- 
25 pressed when the heterologous peptide coding sequence does 
not contain a translation termination codon) . This vector 
also produces a temperature-sensitive repressor (cI857) 
which causes viral lysogeny at permissive temperatures, 
e.g., 32 °C, and leads to viral lysis at elevated tempera- 
30 tures, e.g., 42 °C. Advantages of this vector include: (1) 
highly efficient recombinant clone generation, (2) ability 
to select lysogenized host cells on the basis of host-cell 
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growth at permissive, but not non-permissive, temperatures, 
and (3) high levels of recombinant fusion protein produc- 
tion. Further, since phage containing a heterologous in- 
sert produces an inactive beta-galactosidase enzyme, phage 
5 with inserts are typically identified using a beta-galacto- 
sidase colored-substrate reaction. 

In the screening procedure reported in Examples 1-3, 
individual cDNA libraries were prepared from the serum of 
one PT-NANBH infeced chimpanzee (#771) and four PT-NANBH 

10 infected humans (designated EGM, BV, WEH, and AG) . These 
five libraries were immunoscreened using PT-NANBH positive 
human or chimpanzee sera (Example 2): 111 lambda gtll 
clones were identified which were immunoreactive with at 
least one of the sera. Of these 111 clones, 93 were exa- 

15 mined for insert hybridization with normal DNA. The in- 
serts were radioactively labelled and used as probes 
against Hindlll/EcoRI doubly-digested human peripheral 
lymphocyte (PBL) DNA (Example 3). Approximately 46% 
(43/93) of the inserts hybridized with normal human PBL DNA 

20 and were therefore not pursued. Inserts from 11 PT-NANBH- 
immunopositive clones derived from chimpanzee #771 sera 
were characterized as exogenous to normal human PBL DNA 
(Example 3) . Of these 11 clones 2 PT-NANBH clones were 
identified having the following characteristics. One clone 

25 (clone 40) was clearly exogenous by repeated hybridization 
tests against normal human PBL DNA, had a relatively small 
insert size (approximately 0.5 kilobases), and was quite 
unreactive with negative control serum. The second clone 
(clone 36) was shown to be reactive with multiple PT-NANBH 

30 antisera, had a relatively large insert size (approximately 
1.5 kilobases), and was exogenous by hybridization testing 
against normal human PBL DNA. The immunoreactive charac- 
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teristics of clones 36 and 40 are summarized in Table 1 
(Example 3). Clone 36 was immunoreactive with chimpanzee 
#771 sera and two HCV-positive human sera, AG and BV. The 
clone 36 antigen did not immunoreact with the negative con- 
5 trol sera SKF. Clone 40 was immunoreactive with chimpanzee 
#771 sera and was cleanly nonreactive when the negative 
control sera was used for screening. 

The DNA sequence of clone 36 was determined in part 
and is shown in Figure 4. This sequence corresponds to 

10 nucleotides 5010 to 6516 of the HCV sequence given in the 
Appendix. The DNA sequence was also determined for the 
clone 40 insert (Figure 3). This sequence is homologous to 
the HCV sequence (Appendix) in the region of approximately 
nucleotides 6515 to 7070. The inserts of two other chim- 

15 panzee #771 clones, clones 44 and 45, were found to be 
homologous to clone 40 by hybridization and sequence ana- 
lysis (Example 4) . The sequences for clones 36 and 40 are 
contiguous sequences, with the clone 36 sequences being 
located 5' of the clone 40 sequences as presented in the 

20 Appendix. Accordingly, these two clones represent isola- 
tion of a significant block of the HCV genome by the above- 
described immunoscreening methods. 

The four lambda gtll clones 36, 40, 44, and 45 were 
deposited in the Genelabs Culture Collection, Genelabs 

25 Incorporated, 505 Penobscot Drive, Redwood City, CA 94063. 
Further, the lambda gtll clones of clones 36 and 40 were 
deposited with the American Type Culture Collection, 12301 
Parklawn Dr., Rockville MD, 20852, and given the deposit 
numbers ATCC No. 40901 and ATCC 40893. 

30 

HI. pt-NANBH Sequence Identificatio n bv Hybridization 
Methods . 
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The polynucleotides identified in Section II can be 
employed as probes in hybridization methods to identify 
further HCV sequences, and these can then be used as probes 
to identify additional sequences. The polynucleotides can 
5 be directly cloned or fragmented .by partial digestion to 
generate random fragments. The resulting clones can be 
immunoscreened as described above to identify HCV antigen 
coding sequences. 

To illustrate how the inserts of clones 36 and 40 can 

10 be used to identify clones carrying HCV sequences, the 
insert of clone 40 was isolated and used as a hybridization 
probe against the individual cDNA libraries established in 
lambda gtlO (see above) . Using the clone 40 probe approxi- 
mately 24 independent hybridization-positive clones were 

15 plaque purified (Example 5) . The positive signals arose 
with different frequencies in cDNA libraries from the 
different serum sources, suggesting that the hybridization 
signals were from the serum sources, rather than resulting 
from some common contaminant introduced during the cDNA 

20 synthesis or cloning (Table 2) . One of the clones, 108-2- 
5, which tested positive by hybridization with the clone 40 
insert, had an insert of approximately 3.7 kb (Example 6). 
Since it had such a large insert, clone 108-2-5 was chosen 
for further analysis. The serum source of this cDNA clone 

25 was EGM human PT-NANBH serum (Example 1) . 

The insert of 108-2-5 was isolated by EcoRI digestion 
of the lambda gtlO clone, electrophoretic fractionation, 
and electroelution (Example 6) . The isolated insert was 
treated with DNase I to generate random fragments (Example 

30 6) , and the resulting digest fragments were inserted into 
lambda gtll phage vectors for immunoscreening. The lambda 
gtll clones of the 108-2-5 fragments were immunoscreened 
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(Example 6) using human (BV and normal) and chimpanzee #771 
serum. Twelve positive clones were identified by first 
round immunoscreening with the human and chimp sera. Seven 
of the 12 clones were plaque purified and rescreened using 
5 chimpanzee #771 serum. Partial DNA sequences of the insert 
DNA were determined for two of the resulting clones, 
designated 328-16-1 and 328-16-2. These two clones 
contained sequences essentially identical to clone 40 
(Example 6) . 

10 -.The clone 36 insert can be used in a similar manner to 

probe the original cDNA library generated in lambda gtlO. 
Specific subfragments of clone 36 may be isolated by 
Polymerase chain reaction or after cleavage with restric- 
tion endonucleases . These fragments can be radioactively 
15 labelled and used as probes against the cDNA libraries 
generated in lambda gtlO (Example 1C) . In particular, the 
5' terminal sequences of the clone 3 6 insert are useful as 
probes to identify clones overlapping this region. 

Further, the sequences provided by the terminal clone 
20 3 6 insert sequences and the terminal clone 40 insert 
sequences are useful as specific sequence primers in first- 
strand DNA synthesis reactions (Maniatis et al.; Scharf et 
al.) using, for example, chimpanzee #771 sera generated RNA 
as substrate. Synthesis of the second-strand of the cDNA 
25 is randomly primed. The above procedures identify or 
produce cDNA molecules corresponding to nucleic acid 
regions that are 5 ' adjacent to the known clone 36 and 40 
insert sequences. These newly isolated sequences can in 
turn be used to identify further flanking sequences, and so 
30 on, to identify the sequences composing the HCV genome. As 
described above, after new HCV sequences are isolated, the 
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polynucleotides can be cloned and immunoscreened to 
identify specific sequences encoding HCV antigens. 

IV. Generating Overlapping Cloned Linking Fragments 
5 This section describes a method for producing and 

identifying HCV peptides which may be useful as HCV- 
diagnostic antigens. The present method is used to 
generate a series of overlapping linking fragments which 
span a segment of nucleic acid. The application of the 

10 method to generating a series of overlapping linking frag- 
ments which span a 7,300 basepair segment of the HCV 
genome, whose sequence is given in the Appendix, will be 
described with reference to Figures 1 and 2. 

As a first step in the method, and with reference to 

15 Figure 1, the nucleic acid of interest is obtained in 
double-strand DNA form. Typically, this is done by 
isolating genomic DNA fragments or by producing cDNAs from 
RNA species present in a sample fluid. The latter method 
is used to generate double-strand DNA from NANBH viral RNA 

20 present in serum from chimpanzees or humans with known PT- 
NANBH infection. Here RNA in the sample is isolated, e.g., 
by guanidinium thiocyanate extraction of PEG precipitated 
virions, and reacted with a suitable primer for first 
strand cDNA synthesis. 

25 First-strand cDNA priming may be by random primers, 

oligo dT primers, or sequence-specific primer(s). The 
primer conditions are selected to (a) optimize generation 
of cDNA fragments which collectively will span the nucleic 
acid segment of interest, and (b) produce cDNA fragments 

3 0 which are preferably equal to or greater than about 1,000 
basepairs in length. In one method applied to HCV RNA, the 
first-strand synthesis is carried out using sequence- 
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specific primers which are complementary to spaced regions 
along the length of the known HCV genomic sequence. The 
primer position are indicated at A, B, C, and D in Figure 
2, which shows a map of the HCV genome segment. The 
5 basepair locations of the primers in the HCV genome are 
given in Example 7 below. Following first strand synthe- 
sis, the second cDNA strand is synthesized by standard 
methods . 

The linking fragments in the method are produced by 

10 sequence-specific amplification of the double-strand DNA 
obtained as above, using pairs of overlap-region primers to 
be described. According to an important advantage of the 
methods of the present invention, it is possible to 
generate linking fragments even when the amount of double- 

15 strand DNA is too low for direct sequence-specific amplifi- 
cation. This limitation was found, for example, with HCV 
cDNA's produced from NANBH- infected serum. Here the amount 
of double-stranded DNA available for amplification is first 
amplified nonspecif icallv by a technique known as Sequence- 

20 Independent Single-Primer Amplification (SISPA) . 

The SISPA technique is detailed in co-owned U.S. 
Patent application for "RNA and DNA Amplification Tech- 
niques", Serial No. 224,961, filed July 26, 1988. The 
method as applied to amplification of HCV cDNA fragments is 

25 also described in Example 7. Briefly, known-sequence 
linker primers cure attached to opposite ends of double- 
stranded DNA in a DNA sample. These linkers then provide 
the common end sequences for primer-initiated amplifica- 
tion, using primers complementary to the linker/primer 

30 sequences. Typically, the SISPA method is carried out for 
20-30 cycles of amplification, using thermal cycling to 
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achieve successive denaturation and primer-initiated 
polymerization of second strand DNA. 

Figure l illustrates the SISPA amplification of duplex 
DNA, to form amplified fragments which have known-sequence 
regions P s . As seen, the fragment mixture includes at least 
some fragments which (a) overlap at regions P, with other 
fragments in the mixture and (b) contain complete linking 
regions between adjacent p, and P 5+I regions. Collectively, 
each linking region bounded by the associated overlap 
regions making up the segment is present in at least one 
DNA fragment. 

The production of overlapping linking fragments, in 
accordance with the methods of the present invention, is 
carried out using the polymerase chain reaction (PCR) 
15 method described in U.S. Patent No. 4,683,195. in practic- 
ing this step of the method, first the total segment of 
interest is divided into a series of overlapping intervals 
bounded by regions of known sequence, as just described. 
In Figure 2, the 7,300 basepair segment of the HCV genome 
has been divided into 10 intervals, each about 500-1,000 
basepairs in length. The intervals are designated accord- 
ing to the forward F s and reverse Rj primers used in amplify- 
ing the sequence, as will be described. The selection of 
the intervals is guided by (a) the requirement that the 
25 basepair sequence at each end of the interval be known, and 
(b) a preferred interval length of between about 500 and 
2,000 basepairs. 

In the method applied to the 7,300 basepair segment of 
the HCV genome, the regions of overlap between the ten 
intervals were additionally amplified, to verify that the 
SISPA-amplified cDNA sample contained sufficient HCV cDNA 
to observe PCR-amplif ication of HCV linking fragments, and 
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that HCV regions along the entire length of the genome were 
available for amplification. Each overlap region in the 
segment can be defined by a pair of primers which includes 
a forward primer P, and a reverse primer Rj which are 
5 complementary to opposite strands of opposite ends of the 
overlap region. The primers are typically about 20 base- 
pairs in length and span an overlap region of about 200 
basepairs. The eleven overlap regions in the HCV segment 
and the regions corresponding to the forward and reverse 

10 primers in each region are given in Example 8. 

The primers are added to the amplified DNA 

material in a PCR reaction mix, and the overlap region 
bounded by the primers is amplified by 20-30 thermal 
cycles. The reaction material is then fractionated, e.g., 

15 by agarose gel electrophoresis, and probed for the presence 
of the desired sequence, e.g., by Southern blotting 
(Southern), using a radiolabeled oligonucleotide probe 
which is specific for an internal portion of the overlap 
region. As described in Example 8, this method was 

20 successful in producing amplified fragments for each of the 
eleven Fj/Rj overlap regions in the HCV genome segment. The 
overlap-region fragments may be used as probes for the 
corresponding (two) linking fragments connected by the 
overlap region. It is emphasized, however, that this 

25 amplification step was employed to confirm the presence of 
amplifiable cDNA along the length of the HCV genome, and 
not as an essential step in producing the desired linking 
fragments. The step is omitted from Figure 1. 

The linking fragments Fj/Rj are produced by a two-primer 

30 PCR procedure in which the SISPA-amplif ied DNA fragments 
are amplified by a primer pair consisting of the forward 
primer F, of one overlap region and the reverse primer Rj of 
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an adjacent overlap region. The ten overlap regions in the 
HCV segment and the regions corresponding to the forward 
and reverse primers in each region are given in Example 9. 
Typical amplification conditions are give in Example 9. 
5 The amplified fragments in each reaction mixture are 
isolated and purified, e.g. , by gel electrophoresis, to 
confirm the expected fragment size. Southern blots may be 
probed with oligonucleotide probes complementary to 
internal regions located between the fragment ends, to 

10 confirm the expected sequence of the fragments. As shown 
at the bottom in Figure 1 , the method generates the 
complete set of linking fragments, where each fragment is 
bounded by an overlap region P ; and P i+1 . 

The method, as applied to generating ten overlapping 

15 linking fragments of the 7,300 basepair HCV genome, is 
described in Example 9. As demonstrated by size criteria 
on gel electrophoresis and by sequence criteria by Southern 
blotting, the method was successful in generating all ten 
of the overlapping fragments spanning the HCV genome. 

20 It will be appreciated that the above flanking 

sequence amplification method can be applied to the 
generation of DNA fragments corresponding to the insert 
sequences .of clones 36 and 40, which have also been 
obtained by immunoscreening . The linker primers flanking 

25 the inserts are easily used to generate sequences corre- 
sponding to the clone inserts. For example, two-primer 
amplification of the SISPA-amplif ied cDNA fragments (Exam- 
ple 7) using the F 12 /R9 primer pair (the sequences of which 
are given in Example 8) is carried out under conditions 

3 0 similar to those described in Example 9. The amplified 
fragment mixture is fractionated by agarose electrophoresis 
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on 1.0 % agarose, and the expected band cut from the gel 
and eluted. 

The purified amplified fragment is treated with the 
Klenow fragment of DNA polymerase I to assure the molecules 
5 are blunt-ended- The fragment is then ligated to EcoRI 
linkers (Example 10) . The mixture is digested with EcoRI 
and inserted into the lambda gtll vector. The resulting 
clones contain the entire coding sequences of either the 
clone 36 or clone 40 inserts. 

10 _ Alternatively, the original amplified 36/40 fragment 

(primers F^/R,) is briefly treated with Exonuclease III 
(Boehringer Mannheim, as per manufacturer's instructions) 
to generate a family of fragments with different 5' ends. 
The digestion products are treated as above and ligated 

15 into the lambda gtll vector. The resulting plagues are 
then immunoscreened. 

Further, different sets of primers, other than the 
F12/R9 primers described above, can be used to directly 
generate sequence encoding all, or portions, of clones 3 6 

20 and 40. For example, primers Fg/R* can generate a fragment 
corresponding to a portion of the 3' sequences of the 
insert of clone 36 (Figure 4) and all of the insert 
sequences of clone 40 (Figure 3) . Also, primers F 7 /R 8 can 
be used to directly generate a fragment corresponding to a 

25 portion of the 5' sequences present in the insert of clone 
36 (Figure 4) . 

V. PT-NANBH Immunoreactive Peptide Fragments 

Several novel peptide antigens which are immunoreac- 
30 tive with sera from human and chimpanzee NANBH- infected 
sera have been generated from the NANBH linking fragments 
produced above, in accordance with the methods of the 
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present invention. Further, this method has confirmed 
antigenic regions previously identified by cDNA library 
immunoscreening (Section II above) . The antigen peptides 
derived from linking fragments are preferably produced in 
5 a method which involves first digesting each of the above 
linking fragments with DNasel under partial digestion 
conditions, yielding DNA digest fragments predominantly in 
the 100-300 basepair size range, as illustrated in Example 
10. The digest fragments may be size fractionated, for 

10 example by gel electrophoresis, to select those in the 
desired size range. 

The digest fragments from each linking fragment are 
then inserted into a suitable expression vector. One 
exemplary expression vector is lambda gtll, the advantages 

15 of which have been described above. 

For insertion into the expression vector, the digest 
fragments may be modified, if needed, to contain selected 
restriction-site linkers, such as EcoRI linkers, according 
to conventional procedures. Typically, the digest frag- 

20 ments are blunt-ended, ligated with EcoRI linkers, and 
introduced into EcoRI-cut lambda gtll. Such recombinant 
techniques are well known in the art (e.g., Maniatis et 
al.). 

The resulting viral genomic library may be checked to 
25 confirm that a relatively large (representative) library 
has been produced for each linking fragment. This can be 
done, in the case of the lambda gtll vector, by infecting 
a suitable bacterial host, plating the bacteria, and exami- 
ning the plagues for loss of beta-galactosidase activity, 
30 as evidenced by clear plagues. 

The presence of a digest-fragment insert in the clear 
plagues can be confirmed by amplifying the phage DNA, using 
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primers specific for the regions of the gtll phage flanking 
the EcoRI insert site, as described in Example 10B. The 
results in Table 3 show that a large percentage of the 
plagues tested in each linking fragment library contained 
5 a digest-fragment insert. 

The linking-fragment libraries may also be screened 
for peptide antigens which are immunoreactive with human or 
chimpanzee sera identified with PT-NANBH chronic, convales- 
cent, or acute infection. One preferred immunoscreening 

10 methpd is described in Example 10B. Here recombinant pro- 
tein produced by the phage- infected bacteria is transferred 
from the plaques to the filter. After washing, the filter 
is incubated with test serum, and then reacted with repor- 
ter-labeled anti-human IgG antibody. The presence of the 

15 peptide antigen on the filter is then assayed for the pre- 
sence of the reporter. As seen from Table 3, several of 
the linking-fragment libraries were positive for immunore- 
active peptides in the primary screen. 

The immunoscreening method just described can be used 

20 to identify library plaques from each of the linking libra- 
ries which are immunoreactive with sera from human or chim- 
panzee with known chronic, convalescent, or acute PT-NANBH 
infection. One exemplary screening procedure is given in 
Example 11, where the ten HCV linking-fragment libraries 

25 are screened with known PT-NANBH (a) human chronic serum, 
(b) chimpanzee acute pooled sera and (c) chimpanzee chronic 
pooled sera. Of the ten libraries examined, only the F,/R 10 
library did not give positive immunoreaction with any of 
the three sera. Several of the fragment libraries, inclu- 

30 ding F3/R4, F 6 /R 12 , F 12 /R 7 , and F 7 /R 8 showed five or more 
positive reactions with chimpanzee acute sera, indicating 
that these libraries each express one or more peptide anti- 
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gens which are useful for detecting chimapanzee or human 
acute PT-NANBH infection. 

The fragment library F 7 /Rj corresponds to an internal ? 
fragment of clone 36 insert (Section II; Figure 4). Accor- 
5 dingly, the linking fragment method confirmed that this DNA 
region encodes a useful antigen. Further, the fragment 
library Fg/R, contains the sequences present in the clone 40 
insert (Section II: Figures 3 and 4). The results in Table 
4 indicate that at least one peptide antigen effective to 
10 detect the presence of chronic- infection serum was isolated 
from the Fg/R, fragment library. 

VI. Immunoreactive 409-1-1 Peptides 
A, Immunoreactive Screening 

15 Two of the immunoreactive plagues identified by immu- 

noreactive screening, designated 409-1-1 (abc) and 409-1- 
l(c-a), were tested for immunoreactivity against well- 
documented PT-NANBH chronic sera which showed strong 
immunoreactivity to the 5-1-1 HCV peptide antigen (Kuo) . 

20 The 5-1-1 HCV peptide antigen has previously been identi- 
fied as immunoreactive against a high percentage of human 
PT-NANBH chronic sera. The 5-1-1 antigen is encoded by the 
sequence between basepairs 3731 and 3857 in the HCV genome 
(Appendix) and is itself contained in a larger peptide an- 

25 tigen C-100 encoded by the sequence between basepairs 3531 
and 4442. The latter peptide is employed in a commercial 
diagnostic kit for detection of human HCV infection 
(Ortho/Chiron) . The kit is reported to react positively 
with about 80% of human chronic PT-NANBH samples, and about 1 

3 0 15% of human acute PT-NANBH sera, as noted above. 

The 409-1-1 (c-a) phage was identified by immuno- 
screening and plague purified, as outlined above. A 



WO 91/15516 PCT/US91/02370 



25 



10 



related clone, designated 409-1-1 (abc) , was described in 
the parent to the present application (U. S. Application 
Ser. No. 07/505,611, herein incorporated by reference). 
Clone 409-1-1 (abc) was designated 409-1-1 in the parent 
application. The a, b and c designations refer to three 
regions of the 409-1-1 (abc) sequence (see Figure 5). The 
5-1-1 coding sequence was isolated by polymerase chain 
reaction using oligonucleotide primers complementary to the 
ends of the 5-1-1 coding region, and cloned into lambda 
gti3_.for expression under induction conditions of a fused 
beta-galactosidase protein which includes the 5-1-1 antigen 
peptide region. The 5-1-1 phage was identified and plaque 
purified by similar methods. 

The 409-1-1 (c-a) and 5-1-1 antigens were compared by 
15 plaque immunoscreening with a panel of 28 sera from normal 
(2 donors) , human PT-NANBH-chronic (6 donors) , chimpanzee 
normal (7 donors), chimpanzee PT-NANBH-acute (5 donors), 
and chimpanzee PT-NANBH-chronic (8 donors) , with the 
results shown in Table 5 in Example 12. As can be seen in 
20 Table 5, the 5-1-1 and 409-1-1 (c-a) peptides reacted with 
most of the human and chimpanzee chronic sera, although the 
409-1-1 (c-a) peptide detected a higher percentage of human 
chronic sera samples (83% vs 66%) . The chronic human serum 
which was detected by the 409-1-1 (c-a) peptide, but not by 
25 5-1-1 was from a patient (BV) who died of fulminant NANBH 
infection. Because the 5-1-1 antigen is contained within 
the C-100 antigen in the commercially available kit format 
(Ortho/Chiron) , it was of interest to determine whether the 
C-100 antigen gave a broader range of reactivity with the 
test sera. The results are shown at the right in Table 5 
below. The only human NANBH serum that was tested was the 
above BV serum which was not detected by 5-1-1. This serum 
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was also not iamunoreactive with the C-100 antigen (o/i) 
Nor was the C-100 antigen reactive with any of the five 
acute chimp sera which were tested (0/5) . it is also noted 
that the 409-1-1 (c-a) antigen is immunoreactive with 3 of 
the 5 acute chimpanzee sera tested, compared with only i 
out of 5 for the 5-1-1 antigen. . The results indicate that 
the 409-1-1 (c-a) antigen has broader immunospecificity with 
PT-NANBH sera, and thus would provide a superior diagnostic 
agent. The results obtained with 409-1-1 (c-a) are compara- 
ble _fcp the results obtained using 409-1-1 (abc) . 

It is noted here that the 409-1-1 (abc) coding sequence 
is contained in the F./P, linking fragment and does not 
overlap the sequence of the C-100 (and 5-1-1) coding region 
which is in the F 4/Rj and F 5 /^ linking fragments. The 
relatively long coding sequence of the 409-1-1 (abc) peptide 
illustrates that larger size digest fragments- (substantial- 
ly greater than 300 basepairs) are generated in the partial 
digest step used in producing digest fragments for antigen 
expression. 

The 409-1-1 (abc) peptide, which forms one aspect of 
the invention, has the amino acid sequence which is 
presented as SEQ ID NO: 10. The DNA coding sequence 
corresponding to the insert in the 409-1-1 clone is given 
m Figure 5 and is presented as SEQ ID NO: 9. 

The 409-1-1 (c-a) peptide, which forms another aspect 
of the invention, has the amino acid sequence presented as 
SEQ ID NO: 8. The DNA coding sequence corresponding to the 
insert in the 409-1-1 (c-a) clone is given in Figure 6 and 
is presented as SEQ ID NO: 7. The relationship between the 
coding sequence of 409-1-1 (c-a) and 409-1-1 (abc) is 
outlined in Example 12. Briefly, 409-1-1 (c-a) consists of 
a carboxy terminal region of 409-1-1 (abc) moved to the 
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amino terminus of the 409-1-1 coding sequence, with a 
truncation of the remaining 3 ' 409-1-1 (abc) coding se- 
quence* 

More generally, the invention includes a peptide 
5 antigen which is immunoreactive with sera from humans with 
HCV infection. Such peptide antigens are readily identifi- 
able by the methods of the present invention. 

Antigens obtained from the region corresponding to the 
HCV sequences encoding the 409-1-1 antigens were further 

10 character- 
ized as follows. The primers shown in Table 7 were used to 
generate a family of overlapping amplified fragments 
derived from this region. Several templates were used for 
the DNA amplification reactions (Table 8). The relation- 

15 ships of the coding sequences of the resulting clones to 
each other are graphically illustrated in Figure 7. The 
amplified fragments were then cloned into lambda gtll 
vectors (Example 13). 

These cloned fragments were then immunoscreened 

20 (Example 13). Seven of the nine clones tested positive by 
preliminary immunoscreening (Table 9) . These seven clones 
were then tested against a more extensive battery of PT- 
NANBH serum samples, including numerous human clinical 
samples. The sensitivity of the antigens, in decreasing 

25 order, for reactivity with the serum used for screening was 
as follows: 33cu > 33c > 409-1-1 (c-a) > 409-1-1-F1R2 > 
409-1-1 (abc) - 409-1-la > 5-1-1 > 409-1-1 (c+270) . As can 
be seen from these results all of the alternative clones, 
with the exception of 409-1-1 (c+270) , provided a more 

30 sensitive antigen than 5-1-1. However, although 33cu and 
33c were very sensitive antigens, in this assay they 
reacted slightly with serum which was known to be negative 
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for HCV and may therefore be less specific. Accordingly, 
the 409-1-1 series appears preferable for use as diagnostic 
antigens since they are more specific to HCV-induced 
antibodies . 

5 The immunoscreening was extended to include the clone 

36 and 45 encoded epitopes: the insert of clone 45 is 
essentially the same as the insert of clone 40 (Example 4) . 
As can be seen from the results presented in Table 11, the 
antigens produced by clones 36 and 40, while not as 

10 sensitive as 409-1-1 (c-a) , do yield HCV-specific immunopo- 
sitive signals with selected samples. Accordingly, the two 
methods presented in the present invention, (i) immuno- 
screening of cDNA libraries generated directly from sera- 
derived RNA, and (ii) immunoscreening of amplified- fragment 

15 libraries, can both be seen to be effective methods of 
identifying cDNA sequences encoding viral antigens. 
Further, confirmation of the clone 3 6 and 40 encoded 
antigens by identification of antigens corresponding to 
these HCV regions using the amplif ied-fragment library 

20 method validates the usefulness of the amplif ied-fragment 
method. 

B. Peptide Purification 

The recombinant peptides of the present invention can 

25 be purified by standard protein purification procedures 
which may include differential precipitation, molecular 
sieve chromatography, ion-exchange chromatography, isoelec- 
tric focusing, gel electrophoresis and affinity chromatog- 
raphy. In the case of a fused protein, such as the beta- 

30 galactosidase fused proteins prepared as above, the fused 
protein can be isolated readily by affinity chromatography, 
by passing cell lysis material over a solid support having 
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surface-bound anti-beta-galactosidase antibody. For 
example, purification of a beta-galactosidase/ fusion 
protein, derived from 409-1-1 (c-a) coding sequences, by 
affinity chromatography is described in Example 14. 
5 A fused protein containing the 4 09-1-1 (a) peptide 

fused with glutathione-S-transf erase (Sj26) protein has 
also been expressed using the pGEX vector system in J5. coli 
KM392 cells (Smith). This expression system has the 
advantage that the fused protein is generally soluble and 
10 therefore can be isolated under non-denaturing conditions. 
The fused Sj26 protein can be isolated readily by glutathi- 
one substrate affinity chromatography (Smith) . This method 
of expressing this fusion protein is given in Example 15 
and is applicable to any of the other antigen coding 
15 sequences described by the present invention. 

Also included in the invention is an expression 
vector, such as the lambda gtll or pGEX vectors described 
above, containing the 409-1-1 (a) coding sequence and 
expression control elements which allow expression of the 
20 coding region in a suitable host. The coding sequence is 
contained in the sequence given above corresponding to 
basepairs 2755-3331 of the HCV genome. The control 
elements generally include a promoter, translation initia- 
tion codon, and translation and transcription termination 
25 sequences, and an insertion site for introducing the insert 
into the vector. In the case of the two vectors illu- 
strated in Example 15, the control elements control the 
synthesis of the protein which is fused with the heterolo- 
gous peptide antigen. Such expression vectors can be 
30 readily constructed for the other antigen coding sequences 
described by the present invention. 
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The lambda gtll vectors containing the following 
coding regions have been deposited with The American Type 
Culture Collection, 12301 Parklawn Dr., Rockville MD, 
20852: the 409-1-1 (abc) coding region, designated gtll/- 
5 409-l-l(abc) , ATCC No. 40876; the 409-l-l(c-a) coding 
region, designated gtll/409-l-l(c-a) ATCC No. 40792; clone 
36, designated gtll/ 36, ATCC No. 40901; and, clone 40, 
designated gtll/40, ATCC No. 40893. 

10 VTI_ Tmmunoreactive Clones of the HCV- Capsid Antigen 

At the 1990 Congress of Hepatology a region of the 
full-length HCV nucleic acid sequence was presented, 
nucleotide residues 325-970, containing the HCV non-coding, 
structural core protein and envelope protein coding sequen- 
15 ces as capsid parts of a polyprotein sequence. During the 
course of experiments performed in support of the present 
invention, the coding region that corresponds to the capsid 
protein was more clearly defined. 

Polymerase Chain Reaction primers were constructed 
20 from selected HCV sequence which would generate amplifica- 
tion products of nucleotides 325-970 of the full length HCV 
genome (see Appendix). These primers, SF2(C) and SR1(C), 
are presented in Example 16. The primers contained non- 
complementary sequences which encoded restriction enzyme 
25 cleavage sites to facilitate subsequent cloning manipula- 
tions. The primers were used in amplification reactions 
containing SISPA-amplif ied HCV cDNA molecules (Example 7) 
as substrate. The resulting amplification products were 
cloned into the pGEX and pET vectors (Example 16) . The 
30 pGEX vector allows expression of inserted coding sequences 
as fusion proteins to the Sj26 protein, glutathione-S- 
transf erase. Insertion into the pET vector allows expres- 
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sion of the inserted coding sequences independent of fusion 
sequences . 

These clones were then immunologically screened using 
sera known to be reactive with HCV-antigens (Example 17) . 
5 Several clones in both vectors were identified which were 
immunoreactive with the anti-HCV sera (in pGEX, clones 14, 
15, 56, 60, and 65, Example 17, Table 13). It was observed 
that the fusion proteins which were produced from the 
clones in pGEX were smaller than expected. 

10 „ Clone 15 was selected for scaled up production of the 

Sj26/HCV-antigen fusion protein. The fusion protein 
product (approximately 29 kd) was smaller than the expected 
fusion product (approximately 50 kd, Example 17) . Further, 
the yield of the fusion protein from this preparation was 

15 unexpectedly low. 

Clones 15 and 56 were chosen for nucleic acid sequenc- 
ing of the HCV-antigen containing inserts (Example 18) , 
The sequences of the two clones were very similar with the 
exception that clone 15 had a termination codon starting at 

20 nucleotide position 126. This result suggested that the 
amino terminal 42 amino acids encoded by the HCV insert 
were immunogenic in regard to the anti-HCV sera used for 
immunoscreening . 

To test the suggestion that the amino terminus of the 

25 HCV polyprotein was antigenic, a synthetic oligopeptide was 
constructed essentially corresponding to amino acid 
residues 6-24 of Figure 8A: this peptide had very strong 
immunoreactivity with anti-HCV sera as tested by ELISA. 
PCR primers (Figure 8, CI and NC105) were designed to 

30 generate a clone corresponding to this region (Figure 10, 
C1NC105, SEQ ID NO: 25). Three other synthetic peptides 
were tested, one of which was strongly immunoreactive with 
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anti-HCV sera (amino acid residues 47-74, Figure 8A) and 
two which were weakly immunoreactive (amino acid residues 
39-60 and 101-121, Figure 8A) . These synthetic peptides 
confirm the presence of a strong antigenic region at the 
5 amino-terminal end of the HCV polyprotein in the capsid 
protein region* 

The sequence of clone 56, designated pGEX-GGl-56, is 
shown in Figure 8A and is presented in the sequence listing 
as SEQ ID NO: 11. The sequence shows that the cone has a 

10 long, open reading frame. When production of the fusion 
protein was induced, a fusion protein smaller than the 
expected product was produced, similar in size to the clone 
15 product. The nucleotide sequence of the clones revealed 
a region which is prone to translational frameshifting, 

15 AAAAAAAAAA (Atkins et al. , Wilson et al.). Such a nucleo- 
tide sequence may contribute to the low protein yields when 
these clones are expressed in E. coli. In an effort to 
improve the level of fusion protein expression the third 
nucleotide position of several codons through this region 

20 was changed to a G resulting in the sequence AGAAGAAGAA 
(Example 20) : . the changes had no effect on the protein 
coding sequence (amino acid residues 8-10, Figure 8A) . 
This modified insert was cloned into the pGEX vector and 
the resulting plasmid named pGEX-CapA. 

25 A hydropathicity plot was generated for the protein 

coding sequences of the insert of pGEX-GGl (Example 19, 
Figure 9) . The results of this analysis indicated that the 
carboxy-terminal region of the encoded protein, approxi- 
mately amino acid residues 168-182, had the potential for 

30 being a membrane spanning segment. Since it was unlikely 
that the membrane spanning segment would provide a strong 
antigen and since overproduction of proteins with these 
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regions can adversely affect the growth of bacterial cells, 
a series of carboxy terminal deletions were generated from 
pGEX-CapA (Example 20) . 

To generate the carboxy terminal deletions PCR primers 
5 were designed to be complementary to various regions of the 
pGEX-CapA insert encoded protein. The primers used to 
generate the carboxy terminal deletions are given in Table 
14 and the location of the primers relative to the insert 
coding sequence is presented in Figure 8B. The carboxy 

10 terminal deletion fragments were cloned into the pGEX 
vector and Sj26/HCV-insert fusion proteins were produced. 
These fusion proteins were then screened with anti-HCV sera 
and an epitope map generated for the immunoreactive 
polypeptides (see Figure 10). Clones C1NC270, C1NC360, and 

15 C1NC450 all expressed high levels of the Sj26/HCV fusion 
proteins. Further, these fusion proteins all corresponded 
to the size predicted from their nucleic acid coding 
sequences. Clones C1NC520 and C1NC580 gave poor yields of 
fusion proteins suggesting that when the hydrophobic region 

20 of amino acid residues 168-182 is present it may in part be 
responsible for the poor protein yields previously ob- 
tained. 

The deletion analysis was continued to further dissect 
the antigenic regions of the pGEX-CapA encoded HCV antigen. 

25 A series of amino terminal deletions (primers in Table 15) 
combined with carboxy terminal deletions were generated 
using PCR primers: the locations of all the primers are 
illustrated in Figure 8B. 

The results of the deletion analysis are presented in 

30 Table 16 and in Figure 10. These results, combined with 
the synthetic peptide data presented above, suggest that 
the capsid protein (which comprises the N-terminus of the 
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HCV polyprotein) has two dominant immunoreactive regions . 
Both of these immunoreactive regions are useful use as 
diagnostic antigens. The region comprising the first 35 
amino acids spans one of the epitopes and the region 
5 spanning residues 34-90 encompasses the other strongly 
immunoreactive domain. 

In summary , all of the pGEX clones containing the N- 
terminus of the HCV polyprotein and either 34, 90, 120 or 
150 residues produced large quantities of fusion protein 
10 whic& was shown to be efficiently recognized by HCV 
positive sera. Expression of the PCR inserts containing 
amino acid residues 34-90 was also strongly immunoreactive, 
whereas inserts encoding residues 90-120 or 90-150 were not 
immunoreactive, demonstrating that these regions were not 
15 recognized by human sera. This result suggests that the 
regions important for the production of recombinant 
antigens is contained between residues 1 through 90. 

Analyses of the pGEXClNC450 protein and the pET360 
protein showed that the inclusion of these antigens in 
20 Western and ELISA formats permitted the identification of 
HCV positive sera which had been previously identified as 
either HCV negative or HCV indeterminate. Accordingly, the 
inclusion of these epitopes permits the generation of an 
improved screening system (Example 21) . 
25 VIII. Anti-HCV Antigen Antibodies 

In another aspect, the invention includes antibodies 
specific against the recombinant antigens of the present 
invention. Typically, to prepare antibodies, a host 
animal, such as a rabbit, is immunized with the purified 
30 antigen or fused protein antigen. The host serum or plasma 
is collected following an appropriate time interval, and 
this serum is tested for antibodies specific against the 
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antigen. Example 15 describes the production of rabbit 
serum antibodies which are specific against the 409-1-1 
antigens in the Sj26/409-l-l(a) and beta-galactosidase/409- 
1-1 (c-a) fusion protein. These techniques are equally 
5 applicable to the other antigens of the present invention. 

The gamma globulin fraction or the IgG antibodies of 
immunized animals can be obtained, for example, by use of 
saturated ammonium sulfate or DEAE Sephadex, or other 
techniques known to those skilled in the art for producing 

10 polyclonal antibodies. 

Alternatively, the purified antigen or fused antigen 
protein may be used for producing monoclonal antibodies. 
Here the spleen or lymphocytes from an immunized animal are 
removed and immortalized or used to prepare hybridomas by 

15 methods known to those skilled in the art. To produce a 
human-human hybridoma, a human lymphocyte donor is select- 
ed. A donor known to be infected with an HCV virus (where 
infection has been shown for example by the presence of 
anti-virus antibodies in the blood) may serve as a suitable 

20 lymphocyte donor. Lymphocytes can be isolated from a 
peripheral blood sample or spleen cells may be used if the 
donor is subject to splenectomy. Epstein-Barr virus (EBV) 
can be used to immortalize human lymphocytes or a human 
fusion partner can be used to produce human-human hybrido- 

25 mas. Primary in vitro immunization with peptides can also 
be used in the generation of human monoclonal antibodies. 

Antibodies secreted by the immortalized cells are 
screened to determine the clones that secrete antibodies of 
the desired specificity, for example, using the Western 

3 0 blot method described in Example 15. 
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IX. Utility 

A. Diagnostic Method and Kit 

The antigens obtained by the methods of the present 
invention are advantageous for use as diagnostic agents for 

5 anti-HCV antibodies present in HCV- infected sera; particu- 
larly, the 409-1-1 antigens (409-1-1 (abc) , 409-l-l(c-a) , 
and related antigens (see Table 9) ; the clone 36 antigen; 
and, the clone 40 antigen and the capsid antigen. As noted 
above, many of the antigens provide the advantage over 

10 knowji, HCV antigen reagents 5-1-1 and C-100 in that they are 
immunoreactive with a wider range of PT-NANBH infected 
sera, particularly acute- infect ion sera. This is particu- _ 
larly true of combinations of the 409-1-1 antigens with the 
HCV-core protein antigens as described in Section VII 

15 above. The antigens 409-1-1 (c-a) and Cap450 have been 
combined in an ELISA test kit and tested against HCV test 
kits produced by Abbott and Ortho. The antigens of the 
present invention consistently identify more HCV+ samples 
with a high degree of specificity which is comparable to or 

20 better than the Abbott and Ortho test kits. 

In one preferred diagnostic configuration, test serum 
is reacted with a solid phase reagent having a surface- 
bound HCV antigen (or antigens) obtained by the methods of 
the present invention, e.g., the 409-1-1 (c-a) antigen and 

25 the Cap450 antigen. After binding anti-HCV antibody to the 
reagent and removing unbound serum components by washing, 
the reagent is reacted with reporter-labeled anti-human 
antibody to bind reporter to the reagent in proportion to 
the amount of bound anti-PT-NANBH antibody on the solid 

30 support. The reagent is again washed to remove unbound 
labeled antibody, and the amount of reporter associated 
with the reagent is determined. Typically, the reporter is 
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an enzyme which is detected by incubating the solid phase 
in the presence of a suitable f luorometric or colorimetric 
substrate. 

The solid surface reagent in the above assay is 
5 prepared by known techniques for attaching protein material 
to solid support material, such as polymeric beads, dip 
sticks, 96-well plate or filter material • These attachment 
methods generally include non-specific adsorption of the 
protein to the support or covalent attachment of the 

10 protein, typically through a free amine group, to a 
chemically reactive group on the solid support, such as an 
activated carbqxyl, hydroxy 1, or aldehyde group. 

In a second diagnostic conf iguration, known as a 
homogeneous assay, antibody binding to a solid support 

15 produces some change in the reaction medium which can be 
directly detected in the medium. Known general types of 
homogeneous assays proposed heretofore include (a) spin- 
labeled reporters, where antibody binding to the antigen is 
detected by a change in reported mobility (broadening of 

20 the spin splitting peaks), (b) fluorescent reporters, where 
binding is detected by a change in fluorescence efficiency, 
(c) enzyme reporters, where antibody binding effects 
enzyme / substrate interactions , and ( d ) liposome-bound 
reporters, where binding leads to liposome lysis and 

25 release of encapsulated reporter. The adaptation of these 
methods to the protein antigens of the present invention 
follows conventional methods for preparing homogeneous 
assay reagents. 

In each of the assays described above, the assay 

3 0 method involves reacting the serum from a test individual 
with the protein antigen and examining the antigen for the 
presence of bound antibody. The examination may involve 
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attaching a labeled anti-human antibody to the antibody 
being examined, either IgM (acute phase) or IgG (convales- 
cent or chronic phase), and measuring the amount of 
reporter bound to the solid support, as in the first 
5 method, or may involve observing the effect of antibody 
binding on a homogeneous assay reagent, as in the second 
method. 

Also forming part of the invention is an assay system 
or kit for carrying out the assay method just described. 
10 Thejcit generally includes a support with surface-bound 
recombinant HCV antigen (e.g., the 409-1-1 antigens, etc., 
as above), and a reporter- labeled anti-human antibody for 
detecting surface-bound anti-PT-NANBH-antigen antibody. 

As discussed in Section III above, peptide antigens 
15 associated with several of the linking-fragment libraries 
are immunoreactive with acute NANBH sera from chimpanzees, 
indicating that the peptides would be useful for detecting 
acute NANBH infection in human serum. In particular, one 
or more peptide antigens produced by the linking fragment 
20 libraries, F 8 /R, (reactive with chronic sera) , T^, F 6 B n , 
FjjRj, F 7 Rg, or F 7 R g (which are shown in Example 11 to produce 
one or more peptide antigens which are immunoreactive with 
acute chimpanzee sera) can be combined with the 409-1-1 
antigens to provide a diagnostic composition capable of 
25 immunoreacting with a high percentage of both chronic and 
acute human NANBH serum samples. Further, as discussed in 
Section VII above inclusion of the HCV-capsid protein 
antigens of the present invention add an extra level of 
sensitivity. 

A third diagnostic configuration involves use of the 
anti-HCV antibodies, described in Section VI above, capable 
of detecting HCV specific antigens. The HCV antigens may 
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be detected, for example, using an antigen capture assay 
where HCV antigens present in candidate serum samples are 
reacted with an HCV specific monoclonal antibody. The 
monoclonal antibody is bound to a solid substrate and the 
5 antigen is then detected by a second, different labelled 
anti-HCV antibody: the monoclonal antibodies of the 
present invention which are directed against HCV specific 
antigens are particularly suited to this diagnostic method, 

10 B. -Peptide Vaccine 

The HCV antigens identified by the methods of the 
present invention, e.g. 409-1-1 (c-a) and HCV-core protein 
antigens, can be formulated for use in a HCV vaccine. The 
vaccine can be formulated by standard methods, for example, 
15 in a suitable diluent such as water, saline, buffered 
salines, complete or incomplete adjuvants, and the like. 
The immunogen is administered using standard techniques for 
antibody induction, such as by subcutaneous administration 
of physiologically compatible, sterile solutions containing 
20 inactivated or attenuated virus particles or antigens. An 
immune response producing amount of virus particles is 
typically administered per vaccinizing injection, typically 
in a volume of one milliliter or less. 

A specific example of a vaccine composition includes, 
25 in a pharmacologically acceptable adjuvant, a recombinant 
409-1-1 (c-a) peptide. The vaccine is administered at 
periodic intervals until a significant titer of anti-HCV 
antibody is detected in the serum. Such vaccines can also 
comprise combinations of the HCV antigens of the present 
30 invention. 
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C. Passive Immunoprophylaxis 

The anti-HCV antibodies of the invention can be used 
as a means of enhancing an anti-HCV immune response since 
antibody- virus complexes are recognized by macrophages and 
5 other effector cells. The antibodies can be administered in 
amounts similar to those used for other therapeutic 
administrations of antibody. For example, pooled gamma 
globulin is administered at 0,02-0.1 ml/lb body weight 
during the early incubation of other viral diseases such as 

10 rabi&s, measles and hepatitis B to interfere with viral 
entry into cells. Thus, antibodies reactive with, for 
example, the 409-1-1 (c-a) antigen can be passively adminis- 
tered alone in a "cocktail" with other anti-viral antibod- 
ies or in conjunction with another anti-viral agent to a 

15 host infected with a PT-NANBH virus to enhance the immune 
response and/ or the effectiveness of an antiviral drug. 

The following examples illustrate various aspects of 
the invention, but are in no way intended to limit the 
20 scope thereof. 

Materials 

E. coli DNA polymerase I (Klenow fragment) was 
obtained from Boehringer Mannheim Biochemicals (Indianapo- 
25 lis, IN). T4 DNA ligase and T4 DNA polymerase were 
obtained from New England Biolabs (Beverly, MA) ; Nitrocel- 
lulose filters were obtained from Schleicher and Schuell 
(Keene, NH) . 

Synthetic oligonucleotide linkers and primers were 
30 prepared using commercially available automated oligonu- 
cleotide synthesizers. Alternatively, custom designed 
synthetic oligonucleotides may be purchased, for example, 
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from Synthetic Genetics (San Diego, CA) . cDNA synthesis 
kit and random priming labeling kits were obtained from 
Boehringer-Mannheim Biochemical (BMB, Indianapolis, IN) . 

5 Example 1 

Construction of NANB-containi Tia cDNA libraries 

A. Infection of a Chimpanzee with HCV 

A chimpanzee (#771) was inoculated with a Factor VIII 
preparation which was known to cause parenterally transmit- 

10 ted^non-A non-B hepatitis (PT-NANBH) in human patients 
treated with the Factor VIII concentrate (Bradley) . Post- 
infection ultrastructural changes in liver tissue were 
observed by electron microscopy and ALT (alanine amino 
transferase) elevation was observed in the infected 

15 chimpanzee. These observations are consistent with PT- 
NANBH infection. 

B. Isolation of RNA from Sera 

Serum was collected from the above described infected 
20 chimpanzee (#771) and four human PT-NANBH clinical sources 
(EGM, BV r CC and WEH) . Ten milliliters of each undiluted 
serum was pelleted by centrifugation at 3 OK, for 3 hours in 
an SW40 rotor, at 4°C RNA was extracted from each result- 
ing serum pellet using the following modifications of the 
25 hot phenol method of Feramisco et al. Briefly, for each 
individual serum sample, the pellet was resuspended in 0.5 
ml of 50 mM NaOAc, pH=4.8, containing 1% SDS. An equal 
volume of 60°C phenol was added and incubated for 15 minutes 
at 60°C with occasional vortexing. This mixture was 
30 transferred to a 1.5 ml microfuge tube and spun for two 
minutes at room temperature in a table top microfuge. The 
aqueous phase was transferred to a new microfuge tube. To 
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the aqueous phase, 50 Ml of 3 M NaOAc, pH=5.2, and two 
volumes of 100% ethanol were added. This solution was held 
at -70°C for approximately 10 minutes and then spun in a 
microfuge at 4°C for 10 minutes. The resulting pellet was 

5 resuspended in 100 Ml of sterile glass distilled water. To 
this solution 10 Ml of NaOAc, pH=5.2, and two volumes of 
100% ethanol were added. The solution was held at -70°C for 
at least 10 minutes. The RNA pellet was recovered by 
centrifugation in a microfuge at 12,000 X g for 15 minutes 

10 at 5°C. The pellet was washed in 70% ethanol and dried 
under vacuum. 

C. Synthesis of cDNA 

(i) First Strand Synthesis 

15 The synthesis of cDNA molecules was accomplished as 

follows. The above described RNA preparations were each 
resuspended in 26 Ml of sterile glass distilled water 
(treated with diethyl pyrocarbonate, Maniatis et al. ) , 5 fil 
of 10 X reaction buffer (0.5 M Tris HCl, pH=8.5; 0.4 M KC1; 

20 0.1 M MgCl 2 ; 4 mM DTT) , 10 nl of a nucleotide solution 
(dGTP, dATP, dTTP, and dCTP, each at a concentration of 5 
mM) , 5 nl random primer, 0.25 Ml of ^P-dCTP, 2 Ml AMV 
reverse transcriptase, and 2 Ml of RNASIN (Promega) , in a 
total reaction volume of 50 Ml- This mixture was incubated 

25 for one hour at 42°C. 

(ii) Second Strand cDNA Synthesis 

To the first strand synthesis reaction mixture the 
following components were added: 55 Ml of 2 x second 
strand synthesis buffer (50 mM Tris HCl, pH=7.0; 60 mM 
30 KC1) ; 2 Ml RNase H; 5 Ml DNA polymerase I, and 2 Ml of the 
above described nucleotide solution. The reaction was 
incubated for one hour at 12°C, followed by a one hour 
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incubation at room temperature. The reaction mixture was 
extracted with an equal volume of 1:1 phenol /chloroform, 
followed by an extraction using 24:1 chloroform/isoamyl 
alcohol. To each reaction mixture l /xl of 10 mg/ml tRNA 
5 was added as carrier. The cDNA was precipitated by the 
addition of two volumes of 100% ethanol and chilling at - 
70°C for 15 minutes. The cDNA was collected by centrifu- 
gation, the pellet washed with 70% ethr.no 1 and dried under 



To provide vector compatible ends each of the double 
stranded cDNA preparations was tailed with EcoRI linkers in 
the following manner. 
15 The cDNA was treated with EcoRI methylase under the 

following conditions: The cDNA pellet was resuspended in 
20 fil lx methylase buffer (50 mM Tris HC1, pH=7.5; 1 mM. 
EDTA; 5 mM DTT) , 2 ^1 0.1 mM S-adenosyl-methionine (SAM) and 
2 ^1 EcoRI methylase (New England Biolabs) . The reaction 
20 was incubated for 30 minutes at 37°C. TE buffer (10 mM 
Tris-HCl, pH=7.5; l mM EDTA, pH=8.0) was added to achieve 
a final volume of 80 ^1. The reaction mixture was extracted 
with an equal volume of phenol /chloroform (1:1) and then 
with an equal volume of chloroform/isoamyl alcohol (24:1). 
25 The cDNA was precipitated with two volumes of ethanol. 

To maximize the number of blunt ends for the addition 
of linkers (Maniatis et al, 1982) the cDNA was then treated 
with the Klenow fragment of DNA. polymerase I. The pelleted 
cDNA was resuspended in 11.5 /il of distilled water. The 
30 following components were added to the resuspended cDNA: 
4 //l of 5 X NTB (10 X NTB stock solution: 0.5 M Trie. CI 
pH=7.2; 0.1 M MgS0 4 ; l mM dithiothreitol (DTT); 500 pg/ml 



vacuum. 



10 



-..(iii) Preparation of the Double Stranded cDNA for 



cloning 
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bovine serum albumin (BSA) ) ; 3 //l 0.1 M MgCl 2 , 1.5 ^1 10GATC 
(a solution containing 10 mM of each nucleotide G, A, T, 
and C) , and 1 //l Klenow (Boehringer Mannheim Biochemicals) . 
The reaction mixture was incubated at room temperature for 
5 30 minutes. The reaction mixture was extracted with 
phenol/ chloroform and chloroform isoamyl alcohol as 
described above, and then precipitated with two volumes of 
ethanol . 

The cDNA pellet was resuspended in 12 //I distilled 

10 water, To the resuspended linkers the following components 
were added: 5 fil EcoRI phosphorylated linkers (New England 
Biolabs) , 2 //l lOx ligation buffer (0.66 M Tris.Cl pH=7.6, 
50 mM MgCl 2/ 50 mM DTT, 10 mM ATP) and 1 pi T4 DNA ligase. 
The reaction was incubated at 14 °C overnight. The follow- 

15 ing morning the reaction was incubated at 67 °C for three 
minutes to inactivate the ligase, then momentarily chilled. 
To the ligation reaction mixture 2.5 til of 10 X high salt 
restriction digest buffer (Maniatis et al.) and 2.5 4 of 
EcoRI enzyme were added and the mixture incubated at 37 °C 

20 for at least 6 hours to overnight. To remove excess 
linkers the digestion mixture was loaded onto a 1.2% 
agarose gel and the reaction components size fractionated 
by electrophoresis, size fractions of the 0.3-1.3 Kb and 
1.3-7 Kb ranges were electroeluted onto NA45 paper 

25 (Schleicher and Schuell) „ The NA45 paper, with the eluted 
cDNA bound to it, was placed in a 1.5 ml microfuge tube 
containing 0.5 ml of elution solution (50 mM arginine, 1 M J 
NaCl, pH=9.0). The tube was then placed at 67 °C for 
approximately one hour to allow the cDNA to be eluted from ' 

3 0 the paper into the solution. The solution was then 
phenol/chloroform, chloroform/ isoamyl alcohol extracted and 
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precipitated with two volumes of ethanol. The resulting 
cDNA pellets were resuspended in 20 fil TE (pH=7.5). 
(iv) Cloning of the cDNA into Lambda Vectors 
The linkers used in the construction of the cDNAs 
5 contained an EcoRI site which allowed for direct insertion 
of the amplified cDNAs into lambda gtlO and gtll vectors 
(Promega, Madison WI) . Lambda vectors were purchased from 
the manufacturer (Promega) which were already digested with 
EcoRI and treated with bacterial alkaline phosphatase, to 
10 remove the 5' phosphate and prevent self -ligation of the 
vector . 

The icaRJ-linkered cDNA preparations were ligated into 
both lambda gtlO and gtll (Promega) . The conditions of the 
ligation reactions were as follows: l j/1 vector DNA 
15 (Promega, 0.5 mg/ml) ; 0.5 or 3 4 of insert cDNA; 0.5 4 10 
X ligation buffer (0.5 M Tris-HCl, pH=7.8; o.l M MgCl 2 ; 0.2 
M DTT; 10 mM ATP; 0.5 g/ml BSA) , 0.5 fil T4 DNA ligase (New 
England Biolabs) and distilled water to a final reaction 
volume of 5 pi. 

20 The ligation reaction tubes were placed at 14 °C 

overnight (12-18 hours) . The ligated cDNA was packaged the 
following morning by standard procedures using a lambda DNA 
packaging system (GIGAPAK, Stratagene, LaJolla, CA) , and 
then plated at various dilutions to determine the titer and 

25 recombinant frequency of the libraries. A standard X-gal 
blue/white assay was used to screen the lambda gtll 
libraries (Miller; Maniatis et al.). E. coli HG415 (from 
Howard Gersenfeld, Dept. of Pathology, Stanford School of 
Medicine) plating bacteria, which allows only plague 

30 formation by recombinant clones, was used for plating the 
lambda gtlO libraries. The standard strain, E. coli C600hF-- 
may be used as an alternative to E. coli HG415. 
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Screening the cHNA Tibrarv r r ^ ucilim n „ pfp . waxTCP 

antigens 

5 The five lambda gtll libraries generated in Example l 

were screened for specific HCV encoded viral antigens by 
immunoscreening. The phage were plated for plague forma- 
tion using the Escherichia coli bacterial plating strain E 
coli KM392 (Kevin Moore, DNax, Palo Alto, CA) . Alterna- 
10 tive;y, jr. coli Y1088 may be used.. The fusion proteins 
expressed by the lambda gtll clones were screened with 
serum antibodies (Young et al.) from the following sources- 
chxmpanzee #771 and various human PT-NANBH sera (including 
EGM, BV, WEH and AG) . 

15 From the lambda gtll libraries (Example 1) approxi- 

mately in independent clones gave a positive immunological 
reaction with at least one of the chimp or human PT-NANBH 
sera. These phage clones were plague purified and the 
recombinant phage grown for DNA purification (Maniatis et 

20 al.). 

Example 3 

Genomic Hyhridi ration Scr-^-i^ ^ t ^^^, „ 

Out of the ill pi aque purified recombinant phage, 
obtained as in Example 2, 93 were isolated (Maniatis et 
al.) and digested with EcoRI as per the manufacturer's 
instructions (Bethesda Research Laboratories , Gaithersburg, 
MD). Approximately 1.0 microgram of each digested phage 
DNA sample was loaded into sample wells of 1.0% agarose 
gels prepared using TAB (0.04 m Tris Acetate, 0.001 M 
EDTA). The DNA samples were then electrophoretically 
separated. DNA bands were visualized by ethidium bromide 
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staining (Maniatis et al.). Inserts were clearly identi- 
fied for each of the 93 clones, purified by electroelution 
using NA45, and then radioactively labelled by nick 
translation (Maniatis et al.). 
5 Human peripheral blood lymphocyte (PBL) DNA was 

restriction digested with HiadllZ and EcoRI, loaded on a 
0.7% agarose gel (as above, except 10 M g of DNA was loaded 
per lane) and the fragments separated electrophoretically . 
The DNA fragments in the agarose gels were transferred to 
10 nitrpcellulose filters (Southern) and the genomic DNA 
probed with the nick-translated lambda gtll inserts which 
were prepared above. 

The filters were washed (Southern; Maniatis et al.) 
and exposed to X-ray film. Forty-three of the 93 lambda 
clone inserts displayed a positive hybridization reaction 
with the human PBL DNA. Among the remaining inserts which 
clearly did not hybridize with the PBL DNA, were 11 inserts 
derived from chimp #771 clones which were also clearly 
immunopositive from Example 2. of these 11 clones, two of 
the clones had the immunoreactive characteristics summa- 
rized in Table l. Chimpanzee #771 and humans Ag, BV and 
WEH were chronimc PT-NANBH sera samples and SKF was a 
normal human serum sample. 



15 



20 



25 



30 



35 



Table 1 

■ Sera Clone Designation 

36 40 



+ 



#771 

AG + 
BV + 
WEH 



SKF 
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Clone 40 (original clone screening designation 304-12-1) 
was clearly exogenous, i.e., not derived from normal human 
DNA, as evidenced by repeated hybridization tests against 
normal human PBL DNA, and a second clone, designated clone 
36 (original clone screening designation 303-1-4) , was not 
only exogenous but also reactive with multiple PT-NANBH 
antisera. 



Example & 

10 - Sequenci ng of rinnoc 

DNA sequencing was performed on clones 36 and 40 as 
described in Example 3. Commercially available sequencing 
primers (New England Biolabs) homologous to flanking lambda 
sequences at the 5' and 3' ends of the inserts were 
15 initially used for sequencing. As sequencing progressed 
primers were constructed to correspond to newly discovered 
sequences. Synthetic oligonucleotide primers were prepared 
using commercially available automated oligonucleotide 
synthesizers. Alternatively, custom designed synthetic 
20 oligonucleotides may be purchased, for example, from 
Synthetic Genetics (San Diego, CA) . 

DNA sequences were determined for the complete insert 
of clone 40 (presented as SEQ id N0:1 and also shown in 
Figure 3) ; this sequence corresponds to nucleotides 6516 to 
25 7070 of the HCV genome (Appendix). Subsequently, the 
inserts present in clones 44 and 45 (2 other clones of the 
11 clones identified in Example 3) were found to cross- 
hybridize to the clone 40 insert. Partial sequencing of 
clones 44 and 45 showed that the sequences obtained from 
these two clones matched the sequence of clone 40. A par- 
tial sequence of the clone 36 insert was determined and is 
presented as SEQ ID NO: 3; the complete sequence is pre- 
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sented as SEQ ID NO: 5 and is also shown in Figure 4. The 
sequence of clone 36 corresponds to nucleotides 5010 to 
6515 given in the Appendix. 



Example 5 

Screening of the cDNa library -in lambda crM n 
The cDNA libraries in lambda gtio, generated in 
Example l, were screened for the presence of sequences 
10 homologous to the clone 40 insert. 

The lambda gtio libraries were plated at a density of 
approximately 10 4 plagues/plate and plaques lifts were 
prepared according to Maniatis et al. Filters were indexed 
using india ink to allow alignment of the filters with the 
15 parent plate from which the plaque lift was performed. The 
bacteria and phage particles were lysed, and the nitrocel- 
lulose filters were processed and baked as previously 
described (Maniatis et al.). The prehybridization solu- 
tion, per filter, consisted of the following: 5.4 ml 
prehybridization buffer (50 ml of 1 M Tris HCl, pH=7.5; 2 
ml Of 0.5 M EDTA, pH=8.0; 50 ml of 10% SDS; 150 ml of 20 X 
SSC (Maniatis et al.); and, 238 ml of glass distilled 
water); 6.0 ml formamide; 0.4 ml 50 X Denhardt solution (5 
g FICOLL; 5 g polyvinylpyrrolidone; 5 g bovine serum 
albumin; brought to a total volume of 500 ml with glass 
distilled water); and 0.2 ml of single-stranded salmon 
sperm DNA (10 mg/ml) . Each filter was placed in a plastic 
bag and the prehybridization solution was added. The bag 
was sealed and incubated at 37°c overnight with intermittent 
30 mixing of contents. 

The clone 40 lambda DNA was isolated (Maniatis et al.) 
and digested with EcoRI. The resulting fragments were 



20 



25 
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fractionated on an agarose gel and visualized by ethidium 
bromide staining (Maniatis et al.). The DNA fragment 
corresponding to the clone 40 insert, approximately 500 
base pairs, was isolated from the agarose by electroelution 
5 onto NA45. The aqueous suspension of the purified fragment 
was extracted once with a 1:1 phenol/chloroform solution, 
and once with a 24:1 chloroform/ isoamylalcohol solution. 
The DNA was then precipitated with ethanol and resuspended 
in sterile water. 
10 The clone 40 insert was radioactively labelled by nick 

translation and used to probe the lambda gtlO plague lift 
filters. The prehybridization solution was removed from 
the filters. Each filter was hybridized with probe under 
- the following conditions: 5.0 ml of hybridization buffer 
15 (5 ml of 1 M Tris HC1, pH=7.5; 0.2 ml Of 0.5 M EDTA, 
pH=8.0; 5.0 ml of 10% SDS; 14.9 ml of 20 X SSC (Maniatis et 
al.); io g of dextran sulfate; and, glass distilled water 
to a total volume of 50 ml); 5.0 ml formamide; 0.4 ml 50 X 
Denhardt's solution (5 g FICOLL; 5 g polyvinylpyrrolidone; 
20 5 g bovine serum albumin; brought to a total volume of 500 
ml with glass distilled water); and 0.2 ml of single- 
stranded salmon sperm DNA (io mg/ml) . To this hybridiza- 
tion mix was added 50-250 nl of denatured probe (boiled 5- 
10 minutes and quick-chilled on ice) , resulting in approxi- 
25 mately io 6 cpm of labelled probe per filter. The hybridiza- 
tion mix containing the labelled probe was then added to 
the plastic bag containing the filters. The bag was 
resealed and placed under a glass plate in a 37°C water bath 
overnight with intermittent mixing of contents. 
30 The next day the hybridization solution was removed 

and the filters washed three times, for 5 minutes each, in 
2 X SSC (Maniatis et al.) containing 0.5% SDS, at room 
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temperature. The filters were then washed for one hour in 
2 X SSC, containing 0.5% SDS, at 50°C. The filters were 
then washed for 15-60 minutes in 0.1 X SSC, containing 0.1% 
SDS, at 50°C and finally 2 X SSC, 15 minutes, 2-3 X at room 
5 temperature. The washed filters were dried and then 
exposed to X-ray film for detection of positive plagues. 

Approximately 24 plaques from the lambda gtio librar- 
ies were plague purified from the approximately 200 plagues 
which tested positive by the hybridization screen (Table 
10 2). 



15 



Library 


Table 2 

cDNA 
Source 


Positives/ 
Plate 


EGM 


Human 


= 50 


BV 


Human 


= 100 


WEH 


Human 


s25 


#771 


Chimp 


210-15 



25 

Example 6 

Analysis of lambda otio cnw& T .ibrarv Clones Homni ng nnc 

to the Clone 40 insert 

30 The clones identified in Example 5 which have homology 

to the clone 40 insert were analyzed by standard restric- 
tion analysis and the insert sizes were determined. The 
original freguencies of positive hybridization signals per 
plate using the clone 40 insert as probe against the 

35 different cDNA sources are shown in the last column of 
Table 2. That these positive signals arose with different 
freguencies for the different cDNA sources in the lambda 
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gtlO library suggests that the hybridization signals 
originated from the sera source rather than common contami- 
nation introduced during cDNA synthesis or cloning. 

One of the clones (108-2-5) from the EGM-generated 
cDNA library identified by hybridization with the clone 40 
insert, had an insert of approximately 3.7 kb and was 
chosen for further analysis. The insert was isolated by 
EcoRI digestion of the clone, electrophoretic fraction- 
ation, and electroelution (Example 5). The insert was 
treated with DNase I under conditions resulting in partial 
digestion (Maniatis et al.) to generate random fragments. 
The resulting fragments were inserted into lambda gtll 
vectors for expression. The lambda gtll clones were then 
immunoscreened (Example 2) using human (BV and normal) and 
15 chimpanzee #771 sera. Twelve positive clones were identi- 
fied by first round immunoscreening with the human and 
chimp sera. Seven of the 12 clones were plague purified 
and rescreened using chimp serum (#771) . Partial DNA 
sequences of the insert DNA were determined for two of the 
20 resulting clones that had the largest sequences, designated 
328-16-1 and 328-16-2. The 2 clones had sequences essen- 
tially identical to clone 40. 

Example 7 

25 Preparing Amplifi ed HCV cDNA Fragments 

A. Preparing cDNA fragments 

A plasma pool obtained from a chimpanzee with chronic 
PT-NANBH was obtained from the Centers for Disease Control 
(CDC) (Atlanta, GA) . After direct pelleting or PEG 

30 precipitation, RNA was extracted from the virions by 
guanidinium thiocyanate-phenol-chloroform extraction, 
according to published methods (Chomczynski) . The pelleted 
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RNA was used for cDNA synthesis using oligo dT or random 
primers, or HCV sequence-specific primers and a commercial 
cDNA kit (Boehringer-Mannheim) . 

In one method, synthesis of first strand cDNA was 
5 achieved by addition of four primers, designated A, B, c, 
and D, having the sequences shown below. These sequences 
are complementary to the HCV genomic regions indicated: 
A: 5 ' -GCGGAAGCAATCAGTGGGGC-3 ' , complementary to basepairs 
394-413; 

10 B: 5 ' — GCCGGTCATGAGGGCATCGG-3 ' , complementary to basepairs 
2960-2980; 

C: 5 ' -CGAGGAGCTGGCCACAGAGG-3 ' , complementary to basepairs 
5239-5258; and 

D: 5 '-TGGTTCTATGGAGTAGCAGGCCCCG-3 ' , complementary to 
15 basepairs 7256-7280. 

Second strand cDNA synthesis was performed by the 

method of Gubler and Hoffman. The reactions were carried 

out under standard cDNA synthesis methods given in the 
commercial kit. 



20 



B. Amplifying the cDNA Fragments 

The cDNA from above was blunt ended and ligated to the 
linker/primer having the following sequence: 

25 Linker/primer: 5'-GGA ATT CGC GGC CGC TCG-3 ' A-strand 

3'-TT CCT TAA GCG CCG GCG AGC-5 ' B-strand 



30 



The cDNA and linker were mixed at a 1:100 molar ratio in 
the presence of 0.3 to 0.6 Weiss units of T4 DNA ligase. 
To 100 „1 of 10 mM Tris-Cl buffer, pH 8.3, containing 1.5 
mM Mgci 2 and 50 mM KC1 (Buffer A) was added about 1 x 10-3 
1*3 of the linker-ended cDNA, 2 0L of linker /primer A (A- 
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strand) having the sequence d ( 5 ' -GGAATTCGCGGCCGCTCG-3 ' ) , 
200 m*L each of dATP, dCTP, dGTP, and dTTP, and 2.5 units of 
Thennus aquaticus DNA polymerase (Taq polymerase) . The 
reaction mixture was heated to 94°C for 30 sec for denatur- 
5 ation, allowed to cool to 50°C for 30 sec for primer 
annealing, and then heated to 72°C for 0.5-3 minutes to 
allow for primer extension by Tag polymerase. The replica- 
tion reaction, involved successive heating, cooling, and 
polymerase reaction, was repeated an additional 25 times 
10 with the aid of a Perkin-Elmer Cetus DNA thermal cycler. 
This results in a pool of SISPA (sequence- independent 
single primer amplification) -amplified DNA fragments. 

Example a 

15 Preparin g Primer-Pair FraermpntR 

Amplified cDNA fragments from Example- 7 were mixed 
with 100 Buffer A, 1 /M of equal molar amounts of one of 
the primer pairs given below, 200 pK each of dATP, dCTP, 
dGTP, and dTTP, and 2.5 units of Thermus aouaticus DNA 

20 polymerase (Taq polymerase) . Each primer pair includes a 
forward (upstream) primer Fj which is identical to the 
coding strand at the upstream end of an overlap region P ; of 
duplex genomic DNA and a reverse primer R; which is comple- 
mentary to the coding at the downstream end of the region 

25 Pj. The sets of primers each define an overlap region of 
about 200 basepairs, and the spacing between adjacent 
overlapping primer regions (i.e., between adjacent pairs of 
Fj/Rj pairs) is about 0.5-1 kilobase. The regions of HCV 
which are complementary to the primers are given below: 

30 Fj, basepairs 183-201; R,, basepairs 361-380 
F, 0 , basepairs 576-595; R, 0 , basepairs 841-860 
F 2 , basepairs 1080-1100; R 2/ basepairs 1254-1273 
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F 3 , basepairs 1929-1948; R 3 , basepairs 2067-2086 
F 4 , basepairs 2754-2733; R,, basepairs 2920-2940 
F 5/ basepairs 3601-3620; Rj, basepairs 3745-3764 
F 6 , basepairs 4301-4320; R«, basepairs 4423-4442 
5 F, 2f basepairs 4847-4865; R, 2 , basepairs 4715-4734 
F 7/ basepairs 5047-5066; R 7 , basepairs 5200-5216 
F„ basepairs 5885-5904; R,, basepairs 6028-6047 
F 9 , basepairs 6902-6921; R,, basepairs 7051-7070 

Polymerase Chain Reaction (PCR) amplification of the 
10 SISPA-amplified cDNA fragments with each Fj/Rj primer pair 
was carried out under conditions similar to those used 
above, with about 25 cycles. 

The amplified fragment mixtures from above were each 
fractionated by electrophoresis on 1.5% agarose and 
15 transferred to nitrocellulose filters (Southern) . Hybrid- 
ization of the nitrocellulose-bound fragments, each with an 
internal-sequence oligonucleotide probe confirmed that each 
fragment contained the expected sequences. Hybridization 
was carried out with an internal oligonucleotide radiola- 
20 beled by polynucleotide kinase, according to standard 
methods . 

Example 9 
Preparing Linking Fragments 

25 This example describes preparing large overlapping 

linking fragments of the HCV sequence. SISPA-amplified 
cDNA fragments from Example 7 were mixed with 100 pi Buffer 
A, 1 fM of equal molar amounts of forward and reverse 
primers in each of the primer pairs given below, 200 pK each 

30 of dATP, dCTP, dGTP, and dTTP, and 2.5 units of Thermus 
aquaticus DNA polymerase (Taq polymerase), as in Example 8. 
Each primer pair includes a forward primer Fi and a reverse 
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primer R., where f, is the forward primer for one overlap 
region and Ej is the reverse primer of the adjacent 
overlap region. Thus each linking fragment spans two 
adjacent overlap regions. The sets of primers each define 
5 a linking fragment of about 0.5-1 kilobases. The sequences 
of the primer pairs are given in Example 8. The overlap- 
ping linking fragments of the , HCV sequence (Appendix) 
spanned by each primer pair is given below: 
F,/R 10# basepairs 183-860 

10 F 10 /R 2/ basepairs 576-1273 
F 2 /R 3 , basepairs 1080-2086 
F3/R,, basepairs 1929-2940 
F 4 /Rj, basepairs 2754-3762 
Fs/Re, basepairs 3601-4442 

15 F 6 /R 12 , basepairs 4301-4865 
F 12 /R 7 , basepairs 4715-5216 
. F 7 /Rg, basepairs 5047-6047 
F s /Rj, basepairs 5885-7070 

20 Two-primer amplification of the SISPA-amplif ied cDNA 

fragments with each F,/R, primer pair was carried out under 
conditions similar to those described above, with about 25 
cycles. 

The amplified fragment mixtures from above were each 
25 fractionated by agarose electrophoresis on 1.2 % agarose, 
and transferred to nitrocellulose filters (Southern) for 
hybridization with radiolabeled internal oligonucleotide 
probes as above. The analysis confirmed that each linking 
fragment contained the two end-primer sequences from 
30 adjacent overlap regions. The sequences contained in each 
of the linking fragments are indicated in the Appendix. 
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Example in 
Preparing Cloned Peptide Fragments 
A. DNA Fragment Digestion 

Each of the ten linking fragments from Example 9 was 
suspended in a standard digest buffer (0.5M Tris HC1, pH 
7.5; l mg/ml BSA; lOmM MnC12) to a concentration of about 
1 mg/ml and digested with DNAse I at room temperature for 
various times (1-5 minutes). These reaction conditions 
were determined from a prior calibration study, in which 
the incubation time required to produce predominantly 100- 
300 basepair fragments was 

determined. The material was extracted with phe- 
nol/chloroform before ethanol precipitation. 

The fragments in the digest mixture were blunt-ended 
15 and ligated with EcoRI linkers. The resultant fragments 
were analyzed by electrophoresis (5-lOV/cm) on 1.2% agarose 
gels, using PhiX174/HaeIIl and lambda/Hindlll size markers. 
The 100-300 bp fraction was eluted onto NA45 strips 
(Schleicher and Schuell) , which were then placed into 1.5 
20 ml microtubes with eluting solution (1 M NaCl, 50 mM 
arginine, p h 9.0), and incubated at 67 C C for 30-60 minutes. 
The eluted DNA was phenol/ chloroform extracted and then 
precipitated with two volumes of ethanol. The pellet was 
resuspended in 20 fil TE buffer (0.01 M Tris HCl, pH 7.5, 
25 0.001 M EDTA) . 



B. Cloning the Digest Fragments 

Lambda gtll phage vector (Young et al.) was obtained 
from Promega Biotec (Madison, Wi) . This cloning vector has 
a unique EcoRI cloning site 53 base pairs upstream from the 
beta-galactosidase translation termination codon. The 
partial digest fragments from each linking fragment in Part 
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, A were introduced into the EcoRI site by mixing 0.5-1.0 j/g 
.EcoJLT-cleaved lambda gtll, 0.3-3 pi of the above sized frag- t 
ments, 0.5 pi 10X ligation buffer (above) , 0.5 pi DNA ligase 
(200 units) , and distilled water to 5 //l. The mixture was 
5 incubated overnight at 14 °C, followed by in vitro packag- 
ing, according to standard methods (Maniatis, pp. 256-268). 

The packaged phage were used to infect E. coli strain 
KM392, obtained from Dr. Kevin Moore , DNAX (Palo Alto, CA) . 
Alternatively, E. coli strain Y1090, available from the 

10 American Type Culture Collection (ATCC No. 37197), could be 
used. A lawn of KM392 cells infected with about 10 3 -10 4 pfu 
of the phage stock from above was prepared on a 150 mm 
plate and incubated, inverted, for 5-16 hours at 37 °C. The 
infected bacteria were checked for loss of beta-galactosi- 

15 dase activity (clear plaques) in the presence of X-gal 
using a standard X-gal substrate plaque assay method 
(Maniatis) . 

Identification of single plaques containing a digest- 
fragment insert was confirmed as follows. Clear single 

20 plaques (containing the progeny of a single phage) were 
removed from the plate and suspended in extraction buffer 
(Maniatis) to release the phage DNA. The phage extract was 
added to the above DNA amplification mixture in the pre- 
sence of primers which are about 70 basepairs away in ei- 

25 ther direction from the EcoRI site of lambda gtll. Thus 
phage containing a digest-fragment insert will yield an 
amplified digest fragment of about 140 basepairs plus in- 
sert. Phage DNA amplification was carried out as described 
above, with 25 cycles of amplification. The reaction mate- . 

30 rial from each plaque tested was fractionated on 1.5% aga- 
rose, and examined for the size of amplified digest frag- 
ments. Non-recombinant phage gave a 140 basepair band, and 
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recombinant phage, a band which is 140 basepair plus the 
insert sequence in size. The results are shown in column 
2 (REC Freq) of Table 3 below, for the six linking-fragment 
libraries indicated in the first column in Table 3 below. 
5 The denominator in the column- 2 entries is the total number 
of plagues assayed by primer amplification. The numerator 
is the number of clear plaques containing fragment inserts. 
Thus, 3/15 means that 3 plaques tested positive by PCR out 
of a total of 15 clear plaques assayed. 
10 Table 3 

- Library 1 REC Freq 2 1° Screen 3 PA / REC 4 

F2R3 #2 3/15 2 0.33 

F3R4 #1 7/12 0 - 

F4R5 #3 9/10 10 0.37 

F5R6 #5 H/12 37 1-35 

F7R8 #7 0/12 1 ~ 

F8R9 #10 3/12 58 7.73 

Libraries constructed by partial DNasel Digestion of indicated 

5 - Reco^nant % frequency determined by PCR with insert flanking lambda 

3 - Primary 1 screening with chronic human PT-NANBH serum (lilOO) on 1.5X10 

25 4 - PA/REC indicates the number of positive areas detected per actual 
number of recombinant plated 

The library of digest fragments constructed for each 
30 linking fragment was screened for expression of peptides 
which are immunoreactive with a human PT-NANBH serum. The 
lawn of phage-inf ected bacteria was overlaid with a nitro- 
cellulose sheet, transferring PT-NANBH recombinant peptides 
from the plaques to filter paper. The plate and filter 
35 were indexed for matching corresponding plate and filter 
positions . 

The filter was removed after 6-12 hours, washed three 
times in TBS buffer (10 mM Tris, pH 8.0, 150 mM NaCl) , 
blocked with AIB (TBS buffer with 1% gelatin) , washed again 
40 in TBS, and incubated overnight with of antiserum (diluted 
to 1:100 in AIB, 12-15 ml/plate). The sheet was washed 



15 
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twice in TBS and then incubated with alkaline-phosphatase- 
conjugated anti-human IgG to attach the labeled antibody at 
filter sites containing antigen recognized by the anti- 
serum. After a final washing, the filter was developed in 
5 a substrate medium containing 33 j/1 NBT (50 mg/ml stock 
solution maintained at 4°C) mixed with 16 pi BCIP (50 mg/ml 
stock solution maintained at 4°C) in 5 ml of alkaline phos- 
phatase buffer (100 mM Tris, 9.5, 100 mM NaCl, 5 mM MgC12). 
Reacted substrate precipitated at points of antigen produc- 

10 tion, as recognized by the antiserum. 

The total number of plaques which showed antigen-posi- 
tive reaction (positive areas PA) in the primary screen are 
given in the third column in Table 3. The fourth column in 
the table is the frequency of positive areas per total num- 

15 ber of recombinant phage screened (x 10 3 ) . This last column 
is therefore a measure of the relative immunogenicity of 
antigen expressed from a particular linking fragment using 
this particular serum sample. 

20 Example 11 

Screening Digest Fragments 
The digest-fragment libraries of each of the ten link- 
ing fragments from Example 9 were screened with sera from 
a human patient with chronic PT-NANBH and with pooled sera 
5 from chimpanzees with acute PT-NANBH infection and chronic 
PT-NANBH infection. Individual chronic and acute chimpan- 
zee sera from 5 chimpanzees were obtained from the Centers 
for Disease Control. 

The digest-fragment libraries from the linking frag- 
10 ments indicated in Table 4 below were screened with each of 
the three sera, using the screening procedure described in 
Example 10. The total number of positive areas observed in 
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each plate (making up one fragment library) is given in the 
table. The entries in the table which are not in parenthe- 
ses represent the number of positive areas which were 
confirmed by plague purif ication, i.e., by replating 
5 plaques from the positive areas at low dilution and con- 
firming a positive area (secondary screen) . Typically 
about 90-95 percent of the positive areas in the primary 
screen tested positive by secondary screening. The entries 
in parentheses indicate positive areas which have not been 

10 confirmed in a secondary screen. 

As seen from Table 4, all but one of the linking frag- 
ment libraries contained sequences encoding peptide 
antigens which are immunoreactive with either chronic human 
or chimpanzee infected sera. Five of the libraries contain 

15 sequences encoding antigens which are immunoreactive with 
acute sera, indicating that one or more of the antigens in 
this group are effective to detect acute-infection serum. 
Three of these latter libraries — Fg/I^, F 12 /R 7/ and F 7 /R 8 — 
gave over 10 positives in each library. These data are not 

20 corrected for the recombinant frequency in a particular 
library and therefore do not reflect the comparative 
immunogenicity of the various linking fragments. 
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Table 4 






Human 


Acute Pool 


Chronic 


Pool 


P.P. Clones 


P.P. Clones 


i'.P. Clones 


F1R10 


0 


0 


0 


F10R2 


4 


2 


4 


F2R3 


4 


0 


1 


F3R4 


0 


10 


10 


F4R5 


5 


0 


7 


F5R6 


34 


0 


(42) 


F6R12 


(400) 


5 


10(200) 


F12R7 


2 17(200) 


9(200) 


F7R8 


0 


20 


10(130) 


F8R9 


60 


0 


1 



( ) = not plague purified 
P.P. = Plaque Pure 
Acute Pool = CDC Panel of Chimps 
20 Chronic Pool = CDC Panel of Chimps 



Example 12 

25 Immunoscreenina for 409-1-1-Antiaen 

A. Plaque Immunoscreening 

Several clear plaques identified in the primary screen 
of the F 4 /R5 linking fragment were replated and plaque puri- 
fied. One of the purified plaques was designated gtll/409- 

30 1-1 (c-a) . The digest fragment contained in clone 409-1- 
l(c-a) corresponds to two sets of base pairs present in the 
HCV genome and present in clone 409-1-1 (abc) . For ease of 
reference three regions (a, b, and c,) have been designated 
in the 409-1-1 (abc) clone (see below and Figure 5). The 

35 longest homology of base pairs corresponds approximately to 
nucleotides 2754 to 3129 of the Appendix (the "a" region, 
see Figure 5, region delineated by boxes) and the shorter 
homology corresponds approximately to nucleotides 3242 to 
3311 of the Appendix (the "c" region, see Figure 5) : nor- 
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mally the »c« region is located approximately 112 nucleo- 
tides distal the 3' end of the "a" region (see Figure 5). 
The complete sequence of the gtll/409-1-1 (c-a) insert is 
given in Figure 6 and presented as SEQ ID N0:7. This clone 
arose through a ligation event between two independent 
DNasel fragments generated from the F 4 /Rj linking clone and 
has ATCC No. 40792. A related clone, designated 409-1- 
l(abc), has been described in co-owned patent application 
ser. No. 505,611 and has ATCC No. 40876. 

A lambda gtll clone corresponding to the immunoreac- 
tive sequence reported in the EPO application 88310922.5, 
and designated 5-1-1, was prepared by primer-specific 
amplification of the amplified cDNA fragments generated in 
Example 7. The 5-1-1 sequence corresponds to basepairs 
3730-3858 of the HCV sequence (Appendix), in the linking 
fragment F 5 /R«. The primers used for fragment amplification 
are 20 basepair oligomers complementary to the forward and 
reverse sequences of the 3732-3857 basepair 5-1-1 sequence. 
Both oligomers have EcoRI sites incorporated into their 
ends and the forward oligomer is designed to ensure a con- 
tiguous open reading fram with the beta-galactosidase gene. 
The amplified 5-1-1 sequence was purified by agarose gel 
electrophoresis, and cloned into lambda gtll phage. Ampli- 
fication and cloning methods were as described above. 
Phage containing the 5-1-1 sequence were identified and 
purified by primary and secondary screening, respectively, 
with human PT-NANBH serum, also as described above. 

The purified gtll/409-1-1 (c-a) and gtll/5-1-1 clones 
were each mixed with negative lambda gtll phage, plated and 
immunoscreened with a number of different donor sera from 
normal and NANBH- infected humans and chimpanzees, as indi- 
cated in Table 5 below. Each plate was divided into seve- 
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10 



15 



20 



ral equal-area sections, and the corresponding sections on 
the nitrocellulose transfer filter were separately screened 
with the donor sera indicated, using the immunoscreening 
method described in Example 11. The number of positives 
detected for each group of sera by the 5-1-1 and 409-1-1 
(c-a) peptides are shown, as well as a comparison with the 
C-100 test in the ELISA format, in Table 5. 

Table 5 



Source Diagnosis # Donors # Positive 



5-1-1 A09-1-1 (c-al C-100 

Human Normal 2 0 o NT 

Human ANAB 6 4 5 0/1" 

normal 7 0 0 0/5 



Chimp 

Chimp Acute 5 



Chimp Chronic 



1 3 0/5 
8 7 7 5/5_ 



NT, not tested; * only BV serum was tested; N/5 means N positives out 



of five sera tested 



B. Western Blot Screening 

For Western blot screening, gtll/409-1-1 (c-a) phage from Example 
25 11 was used to infect E. coll BMN103 temperature-Bensitive bacteria. 
These bacteria were obtained from the American Type Culture Collection. 
The bacterial host allows expression of a beta-galactosidase /peptide 
antigen fused protein encoded by the vector under temperature induction 
conditions (Hunyh) . 

30 Infected bacteria were streaked, grown at 32°C overnight or until 

colonies were apparent, and individual colonies were replica plated and 
examined for growth at 32°C and 42"C. Bacterial colonies which grew at 
32°C, but not 42«C, indicating integration of the phage genome, were 
used to inoculate 1 ml of HZYDT (Maniatis) broth A saturated overnight 

35 bacterial culture was used to inoculate a 10 ml culture, which was 
incubated with aeration to an O.D. of about .2 to .4, typically 
requiring 1 hour incubation. The culture was then brought to 43«C 
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quickly in a 43«C water bath and shaken for 15 minutes to induce lambda 
gtll peptide synthesis, and incubated further at 37"C for 1 hour. 

The cells were pelleted by centrifugation, and 1 ml of the 
pelleted material was resuspended in 100 „1 Of lysis buffer (62 mM 
5 Tris, pH 7.5 containing 5% mer captoethanol , 2.4 % SDS and 
10% glycerol) . Aliquots (about 15 „1) were loaded directly 
onto gels and fractionated by SDS-PAGE. After electropho- 
resis, the fractionated bands were transferred by electro- 
elution to nitrocellulose filters, according to known 

10 methods (Ausubel et al.). 

The lysate was treated with DNasel to digest bacterial 
DNA, as evidenced by a gradual loss of viscosity in the 
lysate. An aliquot of the material was diluted with Triton 
X-100™and sodium dodecyl sulfate (SDS) to a final concen- 

15 tration of 2% Triton X-100™ and 0.5% SDS. Non-solubilized 
material was removed by centrifugation and the supernatant 
was fractionated by SDS poly aery lamide electrophoresis 
(SDS-PAGE). PAGE, A portion of the gel was stained, to 
identify the peptide antigen of interest, and the corre- 

20 spending unstained band was transferred onto a nitrocellu- 
lose filter. 

The 5-1-1 antigen coding sequence (Example 11) was 
also expressed as a ■ glutathione-S-transferase fusion 
protein using the pGEX vector system, according to pub- 
25 lished methods (Smith) . The fusion protein obtained from 
bacterial lysate and fractionated by SDS-PAGE were trans- 
ferred to a nitrocellulose filter for Western blotting, as 
above . 

western blotting was carried out substantially as 
30 described in Example 10. Briefly, the filters were blocked 
with AIB, then reacted with the serum samples identified in 
Table 5, including human and chimpanzee normal, chronic 
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NANBH, and hepatitis B (HBV) sera sample. The presence of 
specific antibody binding to the nitrocellulose filters was 
assayed by further immunobinding of alkaline-phosphatase 
labelled anti-human IgG. The results of the Western blot 
5 analysis with the Sj26/5-l-l fusion protein and /409-l-l(c- 
a) fusion proteins are shown in Table 6. The data confirm 
that 409-1-1 <c-a) and 5-1-1 peptide antigens are specifi- 
cally immunoreactive with human and chimpanzee NANBH 
antisera. 

10 

Table 6 



Source 


Diagnosis 


# Donors 


# 


Positive 








Sj26 


0-gal 








5-1-1 


409-1-1 (c-a> 


Human 


Normal 


2 


0 


0 


Human 


NANB 


7 


5 


5 


Human 


HBV 


1 


0 


0 


Chimp 


Normal 


5 


0 


0 


Chimp 


NANB 


6 


5 


5 


Chimp 


HBV 


1 


0 


0 



25 

Example 13 
Generation of Alternative Clones 
Alternative clones were generated from the region iden- 
tified in Example 12 as encoding antigen specifically immu- 
3 0 noreactive with human and chimpanzee NANBH antisera- The 
primers shown in Table 7 were selected from the HCV or 409- 
1-1 (abc) coding sequences to generate a variety of over- 
lapping clones. 



WO 91/15516 



PCTAJS91/02370 



10 



67 
Table 7 
Primer Sequence 



33C-F1 
33C-R1 



COSAATTCGCGGTGGACTTTATCCCTGT 
CTCAATTCCAGAGCAACCTCCTCGATG 



409-1-1 < c-a) F CCGAATTCCGCACGCCCGCCGAGACTAC 
409-1-1-F1 COGAATTCTCCACCACCGGAGAGATCCC 
409-1-1-R2 CCGAATTCCACACGTATTGCAGTCTATC 
409-1-1-F3 CCGAATTCGTCACCCAGACAGTCGAT 

CCGAATTCCCCTCCCAAAATTCAAGATGG 
CCGAATTCGCCAGTCCTGCCCCGACGTT 
CCGAATTCGTCCTGGCACACGGGAAG 



409-1-1-R5 
409-1-1 (c-a)R 
409-1-1CR 



15 The primers shown in Table 7 were used in DNA amplifica- 

tion reactions as described in Examples 7B and 8: the pri- 
mers and templates used in each reaction are shown in Table 
8. The amplified fragments were then treated with the 
Klenow fragment of DNA polymerase I, under standard condi- 

20 tions (Maniatis et al.), to fill in the ends of the mole- 
cules. The blunt-end amplified fragments were digested 
with EcoRI under standard conditions and cloned into lambda 
gtll expression vectors essentially as described in Example 
10B. The resulting inserts are aligned for comparison in 

25 Figure 7. 

Table 8 

Berated Fragment Template Primers 

30 ,, c cDNA" 33-C-F1 and 409-1-1-R2 

„rn cDNA" 33-C-F1 and 33-C-R1 

409-1-1 JF1R2) gtll 409-1-1 (c-a) 409-1-1-F1 and 409-1-1-R2 

*?r: i ' atii 409-1-1 c-a 409-1-1-Fi and 409-1-icaR 

409-1-1 (a gtll ZqIJJ e_a 409-1-lcaF and 409-1-1CR 

35 JSi:i:i!ci 2 7o, SSi 13-i-i SS-wcf and 409-1-1-R2 

JS-l-ii Sll 409-l-l(c-a) 409-1-1-F3 and 409-1-lcaR 

* Amplified cDNA fragments from Example 7 



40 
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Example 13 

Immunosgraenina of the Alterna tive Clones 
The alternative clones generated in Example 12 were im- 
munoscreened essentially as described in Example 10B. 
Clones 409-1-1 (abc) and 409-1-1 (c-a) , generated in Example 
12, were also included in the following immunoscreenings. 
The results of the preliminary immunoscreening are shown in 
Table 9. 



10 Table 9 

GLI-l fec 

33C + NT)" 

15 33cu + ND 

409-1-1 (abc) + ND 

409-1-1 (F1R2) + ND 

409-1-1 (a) + ND 

409-1-1 (ca) + ND 
20 409-1-1 (C) 

409-1-1 (c+270) + ND 

409-1-1 u - 



*Not Done 

25 

The GLI-l sera was a human chronic PT-NANBH sera. If a 
clone tested negative with GLI-l it was further examined by 
screening with FEC, a human chronic PT-NANBH sera. 

The seven of the 9 alternative clones which tested posi- 
3 0 tive by the above preliminary immunoscreening were more ex- 
tensively screened against a battery of sera. In addition, 
clone C100 (see Background) was included in the screening. 
The results of this more exhaustive screening are presented ^ 
in Table 10. 



35 
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Table 10 



Serum 



ANTIGEN 



5 








409-1-1 


409-1-1 


409-1-1 


409-1-1 




C100 


33C 


33Cu 


abc 


FIR2 


a 






SKF(-) 


- 


- 


- 


*■ 






10 


FEC< + ) 


+ 


+ 3 


+ 3 


+ 1 


+ 2 


+ 2 




BV 


+ 2 


+ 3 


1 


+ 1 


• * 

+ 1 






Bar 


+ 2 


+ 2 


1 










PP(-) - 


- 


- 


- 










AP 


+ 1 


+ 2 


- 


1 






15 


CP + 


+ 2 


+ 3 


+ 2 


+ 3 


+ 3 


1 

1 




1 
2 


- 
- 


- 
- 


• 
- 








20 


3 


- 


- 


- 


- 


- 






4 


- 


- 


1 


1 


- 






5 




+ 1 












6 


+ 1 


+ 3 


+ 1 


+ 1 


+ 1 






7 


+ 2 


+ 3 


+ 1 


+ 2 


+ 2 




25 


38 




I 


+ 1 


1 


I 






39 




+ 1 


1 


+ 1 








40 + 


+ 1 


+ 2 


+ 1 


+ 1 








41 + 


+ 2 


+ 3 


+ 1 


+ 1 


+ 1 






42 + 


+ 2 


+ 3 


+ 1 


+ 1 


+ 1 




30 


43 
















44 


I 


I 












45 


I 


+ 1 


I 


1 








46 + 


+ 1 


+ 2 


+ 1 


+ 2 


+ 1 






47 + 


+ 1 


+ 2 


+ 2 


+ 2 


+ 3 


m 


35 


B18 - 


+3 


+3 


+1 


+3 


+3 




A7 


+3 


+3 


+1 


+1 


+3 






C7 


+2 


+3 












A3 


+3 


+3 


+1 


+2 


+1 






B7 


+2 


+3 


I 


+3 


+3 




40 


C12 + 


+2 


+3 











409-1-1 5-1-1 

CB 



+ 1 



+ 3 



+ 2 + 2 



+ 2 



+ 1 
+ 2 

I 

+ 1 
+ 2 
+ 2 



+ 1 
+3 
+3 
+3 

+2 
+3 



+ 1 
+ 1 
I 
I 

+ 1 
+ 1 
+ 1 



I 
I 

+1 
+3 



The serum samples used for screening were identified as 
follows: SKF, PT-NANBH negative; FEC, PT-NANBH positive; 
BV, community acquired NANBH; Bar, PT-NANBH positive; PP 
(pre-inoculation pooled chimpanzee serum) , PT-NANBH 
negative; AP (acute HCV pooled chimpanzee serum), PT-NANBH 
positive; and, CP (chronic HCV pooled chimpanzee serum) PT- 
NANBH positive. The numbered serum samples correspond to 
human clinical serum samples which were PT-NANBH positive. 
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The PP, CP, and AP sera were pooled sera samples from 5 
different chimpanzees: the chimpanzee serum samples were 
obtained from the Centers for Disease Control, The scoring 
system presented in Table 10 is a qualitative scoring 
5 system defined as follows: (-) , a clear negative; (+) , 
(1+), (2+) f (3+) , increasing strength of positive signal, 
with (3+) being the strongest signal; and (I) stands for 
Indeterminate, where two readings were different and not 
repeated . 

10 In view of the data presented in Table 10 the sensitivi- 

ty of the antigens in terms of immunoscreening is 33cu > 
33c > 409-1-1 (c-a) > 409-1-1-F1R2 > 409-1-1 (abc) S: 409-1-la 
> 5-1-1 > 409-1-1- (c+270) . Although 33cu and 33c were sen- 
sitive antigens, they reacted with high background against 

15 all sera. Accordingly, the 409-1-1 series are more useful 
as diagnostic antigens since they are more specific to HCV 
induced antibodies. 

The immunoscreening was further extended to include the 
clone 3 6 and 45 (corresponds to clone 40) encoded epitopes 

20 which were identified above. Table 11 shows the results of 
the immunoscreening. 



Table 11 

PANEL I: SEROCONVERSION SPECIMENS 



SERUM 


C-100 


33C 


5.1.1 


ANTIGEN 
409-1-1 

(c-sl) 


36 


45 gtll 


GLI-1 


+ 


4+ 


2+ 


4+ 




3+ 


FEC 


+ 


4+ 


3+ 


4+ 


3+ 




BV 




3+ 




3+ 






SKF(norm) 















1- N01/D69 - I 

2- n /D124 - + 
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3- "/D146 - 

4- "/D211 - 

5- N00/D22 
5 6- "/D29 

7- "/D41 

8- "/D60 

9- "/D137 + 

10 10-N240/D0 - 

11- »/D45 

12- H /D71 - 

13- M /D89 

14- "/D106 - 
15 15- "/D155 - 

16- N228/D0 

17- "/D31 - 

18- M /D41 - 
20 19- n /D51 - 

20- "/D73 

21- "/D93 

22- "/D127 - 

25 23-N192/D114 

24- M /D184 - 

25- '7D224 - 

26- "/D280 - 

30 27-N176/D0 - 

28- "/D66 

29- n /D77 

30- »'/D94 - 

31- "/D200 - 

35 

32- N170/D0 - 

33- M /D27 

34- W /D49 - 
40 35- n /D64 - 

36- "/D183 - 

37- M /D278 - 



71 



I 










+ 










+ 


I 


I 


- 


- 


2 + 


+ 


2+ 






3+ 


2+ 


3+ 






4 + 


3 + 


4+ 


_ 




4+ 


4+ 


4+ 


2+ 




I 


- 


I 


- 


- 


JL 




X 






I 










I 










I 










I 










I 










I 










I 










I 
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SERUM ANTIGEN 

C-100 33C 5.1.1 409-1-1 36 45 



gtll 



38- N144/D63 - I - - 

39- "/D72 I 

40- W /D91 + 2+ + 2 + - 
10 41- M /D289 + 4+ + 3+ 2+ 

42- M /D233 + 4+ 3+ 4+ 2+ 



43-N122/D0 




I 








44- M /D51 




I 


I 


I 




45- "/D57 




2+ 


I 


+ 




46- W /D72 


+ 


2+ 




3+ 


I 


47- " "/D94 


+ 


3+ 


+ 


4+ 


+ 


48- M /D199 


+ 


4+ 


2+ 


4+ 


+ 



15 



20 49-N31/D0 

50- "/D140 

51- M /D154 

52- "/D170 

53- W /D210 
25 54- n /D266 

55 - M /D336 

56- "/D394 

57- N16/D0 
30 58- "/D47 

59- M /D62 

60- "/D83 

61- "/D137 
61- "/D167 

35 63- "/D197 

64- M /D370 



40 The screening sera GLI-l, FEC, BV, and SKF have been 

defined above. The numbered sera samples correspond to 
human clinical serum samples which were PT-NANBH positive: 
these samples were obtained from Dr. Francoise Fabiani- 
Lunel, Hospital La Pitie Salpetriere, Paris, France. As 

45 can be seen from the results presented in Table 11, the 
antigens produced by clones 36 and 40, while not as sensi- 
tive as 409-1-1 (c-a) , do yield HCV-specific immunopositive 
signals. 
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focample 14 
T ^»H nB nf 40 P -1-1 Fusion Protein 
Sepharose 4B beads conjugated with anti-beta galacto- 
sidase were purchased from Promega. The beads were packed 
to 2 ml column and washed successively with phosphate- 
buffered saline with 0.02% sodium azide and 10 ml TX buffer 
(10 mM Tris buffer, pH 7.4, 1% aprotinin) . 

BNN103 lysogens infected with gtll/409-1-1 (c-a) from 
Example 12 were used to inoculate 500 ml of NZYDT broth. 
The culture was incubated at 32"C with aeration to an O.D. 
of about .2 to .4, then brought to 43'C quickly in a 43 C 
water bath for 15 minutes to induce gtll peptide synthesis, 
and incubated further at 37°C for 1 hour. The cells were 
pelleted by centrifugation, suspended in 10 ml of ^ysis 
buffer (10 mM Tris, pH 7.4 containing 2% Triton X-100 and 
1% aprotinin added just before use. The resuspended cells 
were frozen in liquid nitrogen, then thawed, resulting in 
substantially complete cell lysis. The lysate was treated 
with DNasel to digest bacterial and phage DNA, as evidenced 
by a gradual loss of viscosity in the lysate. Non-solubi- 
lized material was removed by centrifugation. 

The clarified lysate material was loaded on the 
Sepharose column, the ends of the column were closed and 
the column was placed on a rotary shaker for 2 hrs. at room 
temperature and 16 hours at 4-C. After the column settled, 
it was washed with 10 ml of TX buffer. The fused protein 
was eluted with 0.1 M carbonate /bicarbonate buffer, pHlO. 
A total of 14 ml of the elution buffer was passed through 
the column, and the fusion protein eluted in the first 4-6 

30 ml of eluate. 

The first 6 ml of eluate from the affinity column were 
concentrated in Centricon™-30 cartridges (Amicon, Danvers 
Mass ) The final protein concentrate was resuspended in 
!oo „1 PBS buffer. Protein purity was analyzed by SDS-PAGE. 

35 A single prominent band was observed. 
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Example 15 ,. 
pr-o parat^n of Ant? -ano-i-l Antibody. 
The 409-1-1 (c-a) digest fragments from lambda gtll ^ 
were released by EcoRI digestion of the phage, and the "A" 
region purified by gel electrophoresis. The purified frag- 
ment was introduced into the pGEX expression vector 
(Smith). Expression of glutathione S-transf erase fused 
protein (Sj26 fused protein) containing the 409-1-1 (a) 
peptide antigen was achieved in E. coll strain KM392 
(above). The fusion protein was isolated from lysed 
bacteria, and isolated by affinity chromatography on a 
column packed with glutathione-conjugated beads, according 
to published methods (Smith) . 

The purified Sj26/409-l-l(a) fused protein was 
injected subcutaneous ly in Freund's adjuvant in a rabbit. 
Approximately 1 mg of fused protein was injected at days 0 
and 21, and rabbit serum was collected on days 42 and 56. 

A purified Sj26/5-l-l fused protein was similarly 
prepared using the an amplified HCV fragment encoding the 
5-1-1 fragment. The fused Sj26/5-l-l protein was used to 
immunize a second rabbit, following the same immunization 
schedule. A third rabbit was similarly immunized with 
purified Sj26 protein obtained from control bacterial 

25 lysate. 

Minilysates from the following bacterial cultures were 
prepared as described in Example 12: (D KM392 cells 
infected with pGEX, pGEX containing the 5-1-1 insert, and 
pGEX containing the 409-l-l(a) insert; and (2) BNN103 

30 infected with lambda gtll containing the 5-1-1 insert and 
otil containing the 409-l-l(c-a) insert. The minilysates 
were fractionated by SDS-PAGE, and the bands transferred to 
nitrocellulose filters for Western blotting as described in 
Example 12. Table 12 shows the pattern of immunoreaction 

35 which was observed when the five lysate preparations (con- 
taining the antigens shown at the left in the table) were 
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40 
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screened with each ot the three rabbit immune sera summa- 
rizing the results, serum from control (SJ26, rabbits was 
i^unoreactive with each of the S j2 6 and 

antigens. Serum from the animal immunized with SJ26/5-1-1 

£2 protein was reactive with all three ,J-« antigens 

La with the beta-gal/5-1-1 fusion protein, indicating the 

presence of specific immunoreaction with the 

serum from the animal immunized with Sj 26/40 9 -l-l(a> fused 

protein was reactive with all three sj-26 antigens and with 

the beta-gaX/409-l-Kc-a, fusion protei^ 

presence of specific immunoreaction With the 401 ■ 1 M=> 

antigen. None of the sera were immunoreactive with beta 

galactosldase (obtained from a commercial source) . 



Antigens 



Sj26 



Table 12 

Antibody 
5-l-l/Sj26 409-1-1 (a)/- 



Sj26 



Sj26 + + 

5-1-1/ + 
(Sj26) + 

5-1-1/ . 

409-1-1 (a) + 
(Sj26) + 

409-1-1 (c-a) + 
(/?-gal) 

Anti-409-l-Ka) antibody present in the sera from the 
animal immunized with the Sj 26/409-l-l (a) 
affinity chromatography, following the ^f^^ 
described in Example 12, but where the l.gand de vat ed 
to the Sepharose beads is the purified ^^ 1 ( 40 »- 1 .^ 
a) fusion protein, rather than the anti-beta-galactosxdase 

antibody . 
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Example 16 

Cloning the HCV Capsid Protein Coding Sequences 
The example describes the cloning of HCV coding 
sequences which encodes the N-terminal region of the HCV 
5 capsid protein. 

The protein sequence of the HCV-capsid associated 
antigen corresponds to the nucleotide residues 325-970 of 
the full length HCV sequence (see Appendix A) . The 
following sequences were used as PCR primers to clone this 
10 region: SF2(C), 5' end starting at nucleotide 325 of the 
full length HCV sequence (Appendix), 5 ' -GCGCCCATGGGCACG- 
ATTCCCAAACCTCA; and SRl(C), 3' end starting at nucleotide 
969 of the full length HCV sequence (Appendix), S'-GCCGG- 
ATCCCTATTACTC (G/ A) TACACAAT (A/G) CT (C/T) GAGTT ( A/G) G . The 
15 anticipated size of the fragment generated using the 
SF2(C)/SR1(C) primer pair was 644 base pairs. 

SISPA-amplified cDNA fragments from Example 7 were 
mixed with 100 //l Buffer A, 1 //M of equal molar amounts of 
each SR2 and SF1 primer given above, 200 //M each of dATP, 
20 dCTP, dGTP, and dTTP, and 2.5 units of Thermus aguaticus 
DNA polymerase (Taq polymerase), as in Example 8. 

Specific amplification of the SISPA-amplified cDNA 
fragments with the capsid primer pair given above was 
carried out under conditions similar to those described in 
25 Example 7, with 1 minute at 72°C and about 30 cycles. 

The amplified fragment mixtures from above were each 
fractionated by agarose gel electrophoresis on duplicate 
1.2% agarose gels, and one of the gels transferred to 
nitrocellulose filters (Southern) for hybridization with 
30 with a radioactively labelled oligonucleotide (Southern) 
having the following sequence: SF3 (M/E) , 5' end starting 
at nucleotide 792 of the full length HCV sequence (Appen- 
dix) , 5 ' — GCGCCCATGGTTCTGGAAGACGGCGTG . This oligonucleotide 
■corresponds to a sequence internal to the amplification 
35 product generated by using the SF2(C) and SR1(C) primers. 
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Eight out of 15 PCR products were identified which gave a 
positive hybridization signal with the internal probe. 

The vectors pGEX (Example 15) and pET (NOVAGEN, 565 
Science Drive, Madison, WI 53711) were chosen for bacterial 
5 expression of protein sequences encoded by the inserts. 
The pGEX vector provided expression of the inserted coding 
sequences as fusion proteins to Sj26 (see Examples 12 and 
15) and the pET vector provided expression of the cloned 
sequences alone. To clone the capsid sequences, the 
10 amplification product bands were excised from the duplicate 
gel. The DNA was extracted from the agarose and doubly- 
digested with Ncol and BamHI. A pGEX vector containing the 
BamHI/Ncol ,cloning sites was also doubly digested with 
BamHI and Ncol. The vector and extracted DNA were then 
15 ligated under standard conditions and the ligation mixture 
transformed into bacterial cells. 

The bacterial transf ormants were cultured under 
ampicillin selection, and the plasmid DNA isolated by 
alkaline lysis (Maniatis et al.). The isolated plasmid DNA 
20 was digested with Ncol and BamHI. The digestion products 
were then electrophoretically separated on an agarose gel. 
The gel was transferred to nitrocellulose and probed with 
radioactively labelled SF3 as above. Twelve clones were 
confirmed to have the insert of interest by the Southern 
25 blot analysis. 

Clones were generated in the pET vector in essentially 
the same manner. 
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Example 17 

•nmrmTinloqical Screenin g of the Putative HCV 
capsid Protein Clones 
This example describes the immunological screening of 
the putative HCV capsid protein clones which were obtained 

in Example 18. 

Of the twelve clones obtained in Example 16, protein 
mini-lysates of 7 clones (clones #8, 14, 15, 56, 60, 65, 
and 66) were prepared as described in Example 12. These 
mini-lysates were fractionated as described and transferred 
to nitrocellulose for Western Blot analysis. Table 13 
shows the pattern of immunoreaction which was observed when 
the 7 lysate preparations were screened with the indicated 
sera . 

Table 13 



Clone 


Sera 






SKF 


FEL 


A6 


B9 


BV 1 


8 












14 




+ 


+ 


+ 


+ 


15 




+ 


+ 


+ 


+ 


56 




+ 




+ 


+ 


60 




+ 


+ 






65 




+ 


+ 


+ 


+ 


SJ26 












5-1-1 






+ 


+ 




409-1-1 













The serum samples used for screening were identified 
as follows: SKF, HCV negative; EEC, HCV positive; BV, 
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community acquired HCV; A6 and B9 correspond to human 
clinical serum samples which were HCV positive. 

Immunoreactive bands identified on the Western blot 
were all smaller than the expected size of 50 kd (based on 
5 the predicted coding sequence of the cloned inserts, see 
below) . 

Clone 15 was chosen for scale-up production of the 
Sj26 fusion protein (Smith et al.). A one liter prepara- 
tion of clone 15 yielded about 200 fig of purified immunore- 

10 active material. The bulk of the immunoreactive material 
appeared in a major doublet band which ran at approximately 
29 kd. The yield from this preparation was unexpectedly 
low: typically with the pGEX system a one liter protein 
preparation yields in the range of 50-100 mg fusion 

15 protein. 

Example 18 

Nucleic Acid Sequences of Clones 15 and 56 
The inserts of clones 15 and 56 (discussed in Example 

20 17) were sequenced as per the manufacturer's instructions 
(US Biochemical Corporation, Cleveland OH) using the 
dideoxy chain termination technique (Sanger, 1979). Each 
of the clones had an open reading frame contiguous with the 
Sj26 reading frame of the pGEX vector. The sequences of 

25 the clone inserts were near identical with only a few minor 
sequence variations: the sequence of clone 15 had a 
termination codon starting at nucleotide position 126. The 
sequence data for clone 56 is presented as SEQ ID NO: 11 and 
in Figure 8A. 

30 The sequencing of the inserts revealed the unusual 

feature of a run of adenine residues from nucleotide 
position 25 to position 34 (Figure 8A) : such sequences are 
similar to sequences known to promote translation frame- 
shifting (Wilson et al., Atkins et al.). The open reading 

35 frame contiguous with the Sj26 coding sequence predicts a 
protein of approximately 23.5 kd. Accordingly, given the 
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approximately 26 kd size of the Sj26 protein fragment in ^ 
this construct (Smith et al.), the complete fusion protein 
would be predicted to be approximately 50 kd. ? 

Kvam ple 19 

n^rr^^ ^ ° f ^ Protein ^ ro ^ by C1 ° ne 56 

The SOAP program from intelliGenetics PC /GENE™ soft- 
ware package was used to generate the hydropathicity plot 
of Figure 9. The SOAP program uses the method of Kyte et 
al to plot the hydropathicity of the protein along its se- 
quence. The interval used for the computation was 11 ammo 
acids. in Figure 9, the hydrophobic side of the plot 
corresponds to the positive values range and the hydrophil- 
ic side to the negative values range. 

The hydopathicity plot indicates (i) the hydrophilic 
nature of the amino terminus of the capsid protein, (n) 
the relatively hydrophobic nature of the region of ammo 
acid residues approximately 122 to 162, and (in) the 
hydrophobic nature of amino acid residues approximately 

20 168-182. ... 

Further, the region of amino acid residues 168-182 

demonstrates potential for being a membrane spanning 
segment (Klein et al.). 

25 Tftram ple 20 

nr1 -^~~ in»ivi. of '**- 56 Pmteln Codi ng^egign 

This example describes the generation of a series of 
carboxy and amino terminal deletions of the HCV capsid pro- 
tein and the effect of these deletions on the immunoreacti- 
30 vity of the resulting proteins. 

A. Carboxy Terminal Deletions of Clone 56. 
As one step to improve the expression of the HCV 
capsid protein, the putative region of transitional 
frameshifting was modified to reduce the probability of a 
frameshift occurring in this region. In each AAA codon 
encoding lysine, (nucleotide positions 25 to 33, Figure 8A) 
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«. third nucleotide in ^ J^^ndard S 

Ficmre 8A) was changed from A to g us y 
I ; B " techniques (Ausubel * HuUis MuUis 

, The sites of these substitutions are indicated in 
Fioure *e three G's placed over the corresponding 

5 Figure 8 A by tn moai fied pGEX clone was con- 

Vs. The sequence of the moame y ^nf-caoA 
♦ •™d as in Example 19 and the clone was named pGEX cap*, 
firmed as in m> p wtoc-cma is shown in Figure 

The insert sequence of clone pGEX capA is 

SB and presented as SEC ID NO: 13. 

The deletion clones were generated using the 
primers given in Tab!e 14. In Table 14 the BamHI site is 
Italicized and the termination codon is underlined. 

Table 14 

CARBOXY TERMINAL DELETION PRIMERS 

15 , CI 5' -CGA TCC ATG GGC ACG AAT CCT AAA CC 

' HC580 5'-G GCC GGK TCC TTA GGC CGA AGC GGG CAC AG 

' NC 5 <-G GCC GGA TCC TM ACC AGG AAG GTT CCC TGT TCC 

' HC 50 5- OCC GGA TCC B» «C CCT GGC ACG GCC TCC 

s " NC 60 .'-a GCC GGA TCC UA. CAA ATT GCG CGA CCT ACG CC 

20 s ; NC370 5 <- G CCC GGA TCC m GCC CTC ATT GCC ATA GAG 

Amplification reactions were carried out essentially 
as described in Example 16 using primer CI paired with each 
. ^ NC primers and purified plasmid pGEX-capA as 

annealed 2 minutes at 50- and 3 minutes at 12 
-"Ihe following sequence comparisons are given relative 

* * ~ :;^e^sTr coLr:r»; ^ 

Cl primer correspond initiating 
insert which contains an Ncol site near » 
methionine. The seance of the NC P™"^^^ 
the nucleotide position indicated, for ««£J^ 

35 gous sequence of the NC580 ^^t .hat posi- 

tion 580. A termination codon is inserted a 
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tion, following a BawHI site. The positions of the primers # 
given in Table 14 are indicated in Figure 8B. The approxi- 
mate locations of the primers relative to the protein ^ 
sequence are indicated in Figure 9. 
5 The resulting amplification products were electrophor- 

etically size fractionated on a polyacrylamide gel and the 
DHA products of the appropriate sizes electroeluted from 
the gel. The amplification products were cloned into both 
the pGEX and the pET vectors for expression. The sequences 
10 of the inserts were confirmed as described in Example 18 . 

The pGEX vectors containing the carboxy-terminal dele- 
tions were transformed into E. coli and the fusion proteins 
purified essentially as follows. Expression of the fusion 
protein was induced with IPTG for 3-4 hours. The cells 
15 were then harvested at 6,000 rpm for 10 minutes. The E. 

coli were then lysed in MTPBS buffer (150 mM NaCl; 16 mM 
Naj HP0 4 ; 4 mM NaH 2 P0„ pH=8.0) after which 1% '•' TRITON X-100," 
3 ^g/ml DNase I, and 1 mM PMSF were added. The lysates were 
centrifuged at 15,000 rpm for 20 minutes. The supernatants 
20 were discarded and the pellets resuspended in 8M urea. The 
components of the resuspenion were separated by HPLC using 
a "BIO-GEL SP-5-PW" column. Typically, the fusion protein 
was the predominant peak: the location of the fusion pro- 
tein was confirmed by Western Blot analysis. Clones 
25 C1NC270, C1NC360, and C1NC450 all expressed Sj26 fusion 
proteins at high levels: the fusion proteins all corre- 
sponded to the size predicted from the insert coding se- 
quence fused to the Sj26 protein and were immunoreactive 
with HCV- P ositive sera (Western Blots were performed as 
30 described in Example 17). Although the supernatants were 
discarded substantial amounts of the fusion proteins were 
also present in the supernatants. Clones C1NC520 and 
C1NC580 gave poor yeilds of fusion proteins. 

An epitope map of the HCV capsid region is presented 
35 in Figure 10: the location of the immunoreactive protein 
coding sequences corresponding to inserts C1NC450, C1NC360, 
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40 



and CWC270 are indicated. The sequences of C1NC4S0 
C1NC360, and C1NC270 are presented in the Sequence Listing 
as SEO ID N0:15, SEQ ID K0:17, and SEQ ID KO:19, respec- 
tively. 

B Amino Terminal Deletions of Clone 56. 
^ino terminal deletion clones were generated using 
the PCR primers given in Table 15. 

Table 15 
AMINO TERMINAL DELETION PRIMERS 
1. C100 GAG CCC ATG GGT GGA GTT TAC TTG TTG CC ' 
2 C270 GAG CCC ATG GGC TGC GGG TGG GCG GG 
3 ' C360 GAG CCC ATG GGT AAG GTC ATC GAT ACC 

Amplification reactions were carried out essentially 
as described above using the primer pairs presented in 
Table 16 and purified plasmid pGEX-CapA as templates the 
amplification reaction included was 1 minute at _ 95» 
annealed 2 minutes at 50", and 3 minutes at 72- for 20 



20 cycles. 

Table 16 

NH 2 Primer C00H Primer Protein Produced? Immunoreactive? 

25 C100 NC450 LOW g 

NC360 YES yES 

NC270 YES 

VP e NO 
30 C270 NC450 NQ 



C360 



NC360 YES 
NC450 YES 



NO 



The following sequence comparison are given relative 
to the nucleic acid sequence presented in Figure 8 B where 
tbe above described A to G substitutions have been made for 
the sequence of pGEX-CapA. The NC660 primer corresponds to 
tL common 3' end of the pGEX-CapA insert which contains a 
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BawHJ site near the end of the insert. The sequence of the ^ 
C primers each start at the nucleotide position indicated, 
for example, the sequence of the NClOO primer begins at ^ 
nucleotide position 100. Each of the C primers introduces 
5 an in-frame initiation codon in the resulting amplification 
product. The positions of the primers given in Table 15 
are indicated in Figure 8B. 

The resulting amplification products were cloned into 
the pGEX and pET vector for expression as described above. 
10 The sequences of the inserts were confirmed. 

The pGEX vectors containing the carboxy-terminal 
deletions were transformed into B. coli, protein mini- 
lysates prepared, and the immunoreactivity of the proteins 
analyzed by Western Blots as described above. The results 
of the analysis are presented in Table 16. Clones 
C100NC270 and C100NC360 expressed Sj26 fusion proteins at 
high levels: the fusion proteins corresponded to the size 
predicted from the insert coding sequence fused to the Sj26 
protein. 

20 An epitope map of the HCV capsid region is presented 

in Figure 10: the location of the protein coding sequences 
corresponding to inserts C100NC270, C100NC360, C270NC360, 
and C270NC450 are indicated. The sequences for C100NC270 
and C100NC360 are presented in the Sequence Listing as SEQ 

25 ID NO: 21 and SEQ ID NO: 23, respectively. 

Example 21 

py panded TTrnnunoscre e nina Using the Capsid Antigen 
This example describes three different comparisons of 
the immunoreactivity of the various HCV antigens of the 
present invention to several battery of sera. 
A. Effectiveness of Cap450 Antigen. 
Table 17 shows the results of 50 human sera samples 
from patients suspected of NANB hepatitis infection. The 
ELISA assays were performed essentially as described by 
Tijssen using the following 3 antigens: C100, 409-1-1 (c- 



30 
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a) , C33u, Cap450 (the protein product of the pGEX-ClNC450 
clone), and with 409-1-1 (c-a) and cap4150 in one well which 
was optimized to give the most sensitive results. These 
ELISA data were compared with the Abbott CI 00 test. 
5 Patient serum was scored positive for Sj26 fusion pro- 

teins (409-1-1 ca, 33u, 5-1-1, and Cap450) if the absor- 
bance was three times the absorbance of that serum on Sj26 
native protein. A sample was scored positive on pET anti- 
gens (cap360) if the absorbance was three times the mean of 

10 the absorbance of negative control sera. A patient serum 
was scored positive on the combined 409-1-1 ca/cap450 assay 
if the absorbance was equivalent or greater than that of 
control positive sera. Samples within 10% of the control 
positive sera were scored weak positives. 

15 [Samples 1-19: Chronic active hepatitis proven by biopsy. 
HBS Ag(-) . 

Samples 20-44: Acute viral hepititis HBsAg(l), ISM Anti- 
HBC(-) , IgM anti-HAV(-) . 

Samples 45-50: Chronic active hepatitis proven by biopsy. 
20 HBsAg(-) • 



Table 17 
Korean Panel II 







Sample 
# 


C100 


409-1-1 
(c-a) 


C33u 


Cap 
450 


Combined 
409-1-1 (c-a) 
+CAP450 




1 






+ 


+ 


+ 


+ 


30 


2 




+ 




+ 


+ 


+ 


* 


3 




+ 


+ 


+ 


+ 


+ 




4 




+ . 


+ 


+ 


+ 


+ 




5 




+ 


+ 


+ 


+ 


+ 




6 






+ 


+ 


+ 


+ 


35 


7 




+ 


+ 


+ . 


+ 


+ 




8 




+ 




+ 


+ 


+ 
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15 



20 



25 



30 



35 




The results demonstrate that the Ca P 450 protein has 
good sensitivity for detecting the presence of anti-HCV an- 
tibodies in sera samples. Three additional samples _ (6 37 
and 47) we re detected. Further, these results mdicate 
that the combination of Ca P 450 and 409-1-1 (c-a) can be used 
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to produce a kit which is very effective for detection of 
anti-HCV antibodies in human sera samples. 

B. Cap450 and Cap360. 
5 The results in Table 18 demonstrate the effectiveness 

of the Cap450 and Cap360 antigen (the protein product 
encoded by of pET-ClNC360) to detect HCV antibodies present 
in human sera. The samples were tested for the presence of 
HCV by ELISA using each individual antigen shown, or with 
10 409-1-1 (c-a) and Cap450 antigens combined in one well. 

Table 18 



SERUM 


PATTENT DIAGNOSIS 


C100 


ELISA 
w x j. 


409-1-1 

I C A 1 


C33u 


Cap 
360 


Combined 
(409-1-1) 


G-131 


Acute Hepatitis; Pt "CO. " 














G-132 


Acute Hepatitis; Pt "CO." 














G-143 


Acute Hepatitis; Pt "CO." 














G-285 


Acute Hepatitis; Pt "CO." 


ND 


ND 


ND 


ND 


ND 




G-150 


Acute P.T. Hepatitis; Pt 
"G.L. " 






I 


I 


+ 


+ 


G-151 


Acute P.T. Hepatitis; Pt 
"G.L." 






I 




+ 




G-152 


Acute P.T. Hepatitis; Pt 
"G.L." 










+ 


+ 


G-153 


Acute P.T. Hepatitis; Pt 
"G.L. H 






I 




+ 


+ 


G-286 


Acute P.T. Hepatitis; Pt 
"G.L." 


ND 


ND 


ND 


ND 


ND 


+ 


G-43 


Fulminant Liver Disease 














G-l 


Community Acquired 
Hepatitis 


ND 


I 


+ 


+ 


+ 


+ 


G-109 


Community Acquired 
Hepatitis 


+ 






+ 


+ 


+ 


G-114 


Community Acquired 
Hepatitis 


ND 












G-128 


Community Acquired 
Hepatitis 


+ 




I 


+ 






G-3 


Community Acquired 
Hepatitis 
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SERUM 


PATIENT DIAGNOSIS 


C100 


EL ISA 
5-1-1 


409-1-1 
(c-a) 


C33u 


cap 
360 


Combined 
( nuy— 1— 1 ) 
+Cap450 


G-126 


Community Acquired 
Hepatitis 


- 


- 


- 


- 


- 


+ 


G-127 


Community Acquired 
Hepatitis 


+ 


I 


+ 




- 




G-42 


Idiopath. Comm. Ac. 
Hepatitis 


- 


- 


- 


- 


- 


- 


G-51 


Community Acquired 
Hepatitis B 


- 


- 


+ 


+ 


+ 


+ 


G-27 


Community Acquired 

HpTjatitis B 














G-22* 


Community Acquired 
H^oafcifeis B 

LI W & ^ w Jb» O V 














G-40 


Community Acquired 
Hepatitis B 


- 


- 


- 


- 


— 


— 


G-31 


Community Acquired 
Hep at it xs s 


4 




4 


+ 


+ 




G-45 


Community Acquired 
Hepatitis B 


- 


- 


- 


- 


- 


- 


G-38 


Fulminant Hepatitis B 


- 


- 


- 


- 


- 


- 


G-41 


Community Acquired 
Hepatites c 






T 

X 


4 


I 


4 


G-13 


Hepatitis C 




1 


T 


+ 


4. 


4 


G-12 


Hepatitis C 


+ 




+ 




4 


4 


G-6 


Hepatitis C 














G-49 


EtOH Cirrhosis 














G-25 


EtOH Cirrhosis 














G-110 


EtOH Cirrhosis 








4. 


T 
X 


4 


G-46 


EtOH Cirrhosis 




a 










G-272 


Infant Liver Transplant 


ND 












G-274 


Infant Liver Transplant 


ND 




+ 


+ 






G-16 


PBC 






+ 


+ 






G-12 3 


INC LT 














G-122 


INC LT 




+ 










G-125 


No Diagnosis 




I 




+ 


I 


+ 


G-124 


No Diagnosis 








+ 


+ 


4- 



These results indicate that the combination of antigen 
409-1-1 (c-a) and Cap360 or Cap450 result in a effective 
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diagnostic tool for detection of HCV infection. Five 
additional samples (G150,G151, G110, G125, and G124) were 
detected with these ELISA' s compared with C100 test alone. 
C. pET360 

The results in Table 19 demonstrate the effectiveness 
of the pET360 to detect HCV antibodies present in human 
sera. The samples were tested for the presence of HCV by 
ELISA using each individual antigen shown, or with 409-1-1 
(c-a) and pET360 antigens combined in one well. 

Table 19 



15 



20 



25 



30 





C100 


5-1-1 


409-1-1 
(c-a) 


C33u 


pET360 


Combined 
409-1-1 (c-a) 
+ pET360 


A 


+ 




+ 


+ 




+ 


B 


+ 


+ 


+ 


+ 




+ 


C 


+ 






+ 




+ 


D 




+ 


+ 




+ 


+ 


E 


+ ' 




w+ 


+ 


+ 


+ 


F 














G 


+ 




w+ 


+ 


+ 


+ 


H 






+ 


+ 


+ 




I 














J 














K 








+ 


+ 


+ 


L 














M 














N 




w+ 






+ 


+ 


O 


+ 


w+ 


+ 


+ 


+ 


+ 


P 


+ 


w+ 


+ 


+ 


+ 


+ 


Q 














R 














S 















35 These results indicate that the combination of antigen 409-1-1 (c- 

a) and pET360 result in a effective diagnostic tool for detection of 
HCV infection. Three additional samples were detected with these 
ELISA ' s compared with C100 test alone. 
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u hBM described with reference to 
Although the invention has ^ UBe< it wil l be 

particular ^^'t*e~Z« various changes and .odifica- 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION t 

(i) APPLICANT: Reyes, Gregory 
Kim, JungBuh P. 
Moeckli, Randolph 
Simonsen, Christian C. 

TITLE OF INVENTION: Hepatitis C Virus Epitopes 
( iii)' NUMBER OF SEQUENCES : 26 

, iv , CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Peter J. Dehlinger 

(B ) STREET: 350 Cambridge Ave., Suite 100 

(C) CITY: Palo Alto 

(D) STATE: CA 
(B) COUNTRY: USA 
(F) ZIP: 94306 

(V ) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
, B ) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-D°S/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, ver 

(vi) CURRENT APPLICATION DATA: 

{A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

( A) APPLICATION NUMBER: US 505,611 

(B ) FILING DATE: 06-APR-1990 

, PRIOR APPLICATION DATA: 

( ^Wi-IOXXO. NUMBER: US 594,854 

(B ) FILING DATE: 09-OCT-1990 



ATTORNEY / AGENT INFORMATION: 
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(A) NAME: Fabian, Gary R- 

(B) REGISTRATION NUMBER: 33,875 

(C) REFERENCE/DOCKET NUMBER: 4600-076.21 

lix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (415) 323-8302 

(2) INFORMATION FOR SEQ ID NOM: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

{i i) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

( vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

( vii) IMMEDIATE SOURCE: 

(B) CLONE: 304-12-1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..561 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 

GAA TTC CTC GTG CAA GCG TGG AAG TCC Lys Thr Pro Met Gly Ph® 

aiu Phe Leu Val Gin Ala Trp Lys ser Lys Lys Thr ^ 



1 



irr CGC TGC ttt GAC TCC ACA GTC ACT GAG AGC GAC ATC 
TCG TAT GAT ACC CGC TGC TTT GA ^ 

~. Thr Arq Cys Phe Asp Ser Thr Val Tnr ^ 

ser Tyr Asp Tftr wg »»jr r 30 



48 



96 



20 25 
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CGT ACG GAG GAG GCA ATC TAC CAA TGT TGT GAC CTC GAC CCC CAA GCC 
Arg Thr Glu Glu Ala lie Tyr Gin Cys Cys Asp Leu Asp Pro Gin Ala 
35 40 45 



144 



CGC GTG GCC ATC AAG TCC CTC ACC GAG AGG CTT TAT GTT GGG GGC CCT 
Arg Val Ala lie Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro 
50 55 60 



192 



CTT ACC AAT TCA AGG GGG GAG AAC TGC GGC TAT CGC AGG TGC CGC GCG 
Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg Ala 
65 70 75 80 



240 



AGC GGC GTA CTG ACA ACT AGC TGT GGT AAC ACC CTC ACT TGC TAC ATC 
Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr lie 
85 90 95 



288 



AAG GCC CGG GCA GCC TGT CGA GCC GCA GGG CTC GAG GAC TGC ACC ATG 
Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr Met 
100 105 110 



336 



CTC GTG TGT GGC GAC GAC TTA GTC GTT ATC TGT GAA AGC GCG GGG GTC 
Leu Val Cys Gly Asp Asp Leu Val Val lie Cys Glu Ser Ala Gly Val 
115 120 125 



384 



CAG GAG GAC GCG GCG AGC CTG AGA GCC TTC ACG GAG GCT ATG ACC AGG 
Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg 
130 135 140 



432 



TAC TCC GCC CCC CCC GGG GAC CCC CCA CAA CCA GAA TAC GAC TTG GAG 
Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu 
145 150 155 160 



480 



CTC ATA ACA TCA TGC TCC TCC AAC GTG TCA GTC GCC CAC GAC GGC GCT 
Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly Ala 
165 170 175 



528 



GGA AAG AGG GTC TAC TAC CTC ACC CGG GAA TTC 
Gly Lys Arg Val Tyr Tyr Leu Thr Arg Glu Phe 
180 185 



561 



(2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 187 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Gl. Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly Phe 
1 5 " " 

ser Tyr Asp Thr Arg cys Phe Asp Ser Thr Val Thr Glu Ser Asp He 
20 25 30 



teg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp Leu Asp Pro Gin Ala 
35 

teg val Ala He Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro 

50 55 60 

Leu Thr Asn ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg Ala 
65 70 " 

Ser' Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr lie 
85 9° " 

Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr Met 

100/ i° 5 110 

Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser Ala Gly Val 
115 1ZU 

cm «. a=p u. u. ■« - «■ ~ I6r "! "* Mt Iht W 

130 135 



Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu 
145 150 155 

LeU lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly Ala 



„ n 175 
165 170 



Gly Lys Arg Val Tyr Tyr Leu Thr Arg Glu Phe 
180 185 
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(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA to inRNA 
(iii) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis HCV Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 303-1-4 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..252 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

«C TCC GTG TGG AAA GAC CTT .CTG GAA GAC AAT GTA ACA CCA ATA GAC 

Asn ser Val Trp Lye Asp Leu Leu Glu Asp Asn Val Thr Pro lie Asp 

in I 5 



i 



RCT ACC ATC ATG GCT AAG ARC GAG GTT TTC TGC GTT CAG CCT GAG AAG 
Th r Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys 
20 25 30 

GGG GGT CGT AAG CCA GCT CGT CTC ATC GTG TTC CCC GAT CTG GGC GTG 
Gly Gly Arg Lys Pro Ala Arg Leu lie Val Phe Pro Asp Leu Gly 
35 40 45 

CGC GTG TGC GAA AAG ATG GCT TTG TAC GAC GTG GTT ACC AAG CTC CCC 
Arg Tl Zs Glu L Y B Met Ala Leu Tyr Asp Val Val Thr Lys Leu Pro 

50 55 



48 



96 



144 



192 
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TTG GCC GTG ATG GGA AGC TCC TAC GGA TTC CAA TAG TCA CCA GGA CAG 240 
Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin 



65 70 75 80 



CGG GTT GAA TTC 
Arg Val Glu Phe 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val Thr Pro He Asp 
! 5 10 15 

Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys 
20 25 30 

Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly Val 
35 40 45 

Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Thr Lys Leu Pro 
50 " 60 

Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin 
65 70 75 80 

Arg Val Glu Phe 



252 



(2) INFORMATION FOR SEQ ID NO: 5; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1512 base pairs 

(B) TYPE: nucleic acid 
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( C) STEANDEDNESS : double 

( D) TOPOLOGX: linear 

(U ) MOLECULE TCTE: cDNA to mRNA 

(iii) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 

,vi) ORIGINAL SOURCE: 

(A ) ORGANISM: Hepatitis C Virus 

(v ii) IMMEDIATE SOURCE: 

* (B) CLONE: 303-1-4 

(ix) FEATURE: 

(A ) NAME/KEY: CDS 

(B) LOCATION: 1..1512 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

« « » « » » « - z z z z z z z z 

8lu Phe Phe Thr ol« I« Asp Gly Val Arg L ^ 



1 5 



~ rw TCA TTC AGA GTA GGA CTC 

20 25 



================ 

AO 



40 



================ 



SO 55 



GAG GCG GCC GGG - - - - - - » " £ Z " S 

Glu Ala Ala Gly Arg Arg Leu Ala Arg Gly ^ 80 



48 



96 



144 



192 



240 



70 
65 
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AGC TCC TCG GOT AGC CAG CTA TCC GCT CCA TCT CTC AAG GCA ACT TGC 
Z Ser L Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys 
85 90 

AC C GCT AAC CAT GAC TCC CCT GAT GCT GAG CTC ATA GAG GCC AAC CTC 
Z Z Z His Asp Ser Pro Asp Ala Glu Leu lie Glu Ala Asn Leu 

ioo 105 1 

CTA TGG AGG CAG GAG ATG GGC GGC AAC ATC ACC AGG GTT GAG TCA GAA 

Z Z Arc Gin Glu Me, Gly 01, Asn lie Thr Arc Val Glu Ser Glu 
115 "° 

^ AAA GTG GTG ATT CTG GAC TCC TTC GAT CCG CTT CTG CCG GAG GAG 
Asn Lys Val Val He Leu Asp Ser Phe Asp Pro Leu Val Ala 
130 " 5 140 

GA C GAG CGG GAG ATC TCC GTA CCC GCA GAA ATC CTG CGG AAG TCT CGG 
Z Z Lo Glu Xle ser Val Pro Ala Glu Xle Leu Arc Lys ser Ar, 

145 »0 155 

AGA TTC GCC CAG GCC CTG CCC GTT TGG GCG CGG CCG GAC TAT AAC CCC 
Z Phe Ala Gin Ala Leu Pro Val Trp Ala Arc Pro Asp Tyr Asn Pro 

165 170 

CCG CTA GTG GAG ACG TGG AAA AAG CCC GAC TAC GAA CCA CCT GTG GTC 
~ Z va! Ci« Thr TTP *. «P T y r Glu » Pro v.! v.! 

180 165 " 

C GGC TGT CCG CTT CCA CCT CCA AAG TCC CCT CCT GTG CCT CCG CCT 

" Z =y. «• - « *• » Lys s " Pro PC ° £ "° "° 

195 200 20 

CGG AAG AAG CGG ACC GTG GTC CTC ACT GAA TCA ACC CTA » TACT GCC 
„ W . «. Arg Thr V.1 V.1 M u Thr «. ser Thr M. Ser Thr 

210 Zlb 

„. GCC GAG CTC GCC ACC AGA AGC TTT GGC AGC TCC TCA ACT TCC GGC 
Xeu Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser Ser ser Thr Ojr 
225 230 235 

ATT AC G GGC GAC AAT ACG ACA ACA TCC TCT GAG CCC GCC CCT TCT GGC 
Z Z -IT -P -n Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly 

245 250 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 
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TGC CCC CCC GAC TCC GAC GCT GAG TCC TAT TCC TCC ATG CCC CCC CTG 816 ( 

Cys Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser Met Pro Pro Leu 
260 265 270 

i 

GAG GGG GAG CCT GGG GAT CCG GAT CTT AGC GAC GGG TCA TGG TCA ACG 864 
Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr 
275 280 285 

GTC AGT AGT GAG GCC AAC GCG GAG GAT GTC GTG TGC TGC TCA ATG TCT 912 
Val Ser Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys Ser Met Ser 
290 295 300 

TAC TCT TGG ACA GGC GCA CTC GTC ACC CCG TGC GCC GCG GAA GAA CAG 960 
Tyr Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin 
305 310 315 320 

AAA CTG CCC ATC AAT GCA CTA AGC AAC TCG TTG CTA CGT CAC CAC AAT 1008 
Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn 
325 330 335 

TTG GTG TAT TCC ACC ACC TCA CGC AGT GCT TGC CAA AGG CAG AAG AAA 1056 
Leu Val Tyr Ser Thr Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys 
340 345 350 

GTC ACA TTT GAC AGA CTG CAA GTT CTG GAC AGC CAT TAC CAG GAC GTA 1104 
Val Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val 
355 360 365 

CTC AAG GAG GTT AAA GCA GCG GCG TCA AAA GTG AAG GCT AAC TTG CTA 1152 
Leu Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu 
370 375 380 

TCC GTA GAG GAA GCT TGC AGC CTG ACG CCC CCA CAC TCA GCC AAA TCC 1200 
Ser Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser 
385 390 395 400 

AAG TTT GGT TAT GGG GCA AAA GAC GTC CGT TGC CAT GCC AGA AAG GCC 1248 
Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala 
405 410 415 



GTA ACC CAC ATC AAC TCC GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA 
Val Thr His He Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val 
420 425 430 



1296 



PCT/US91/02370 
WO 91/15516 



101 



A CA CCA ATA OAC ACT ACC ATC ATG GCT AAG AAC GAG GTT TTC TGC GTT 
ITr Z "e Asp Tn* Tnr He Met Ala Lys Asn Glu Val Phe Cys Val 

445 



435 440 



^ OCT GAG AAG GOG GGT CGT AAG CCA GCT CGT CTC ATC : GTG TTC CCC 
Gl „ Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He Val 



455 460 
450 4M 



6AT CTG GGC GTG CGC GTG TGC GAA AAG ATG GCT TTG TAC GAC GTG GTT 
Z - val Arg val Cys Glu Lys Met Ala Leu Tyr Asp Val Val 

465 470 

,nc RAG CTC CCC TTG GCC GTG ATG GGA AGC TCC TAC GGA TTC CAA TAC 
£ Z Su Pro X- Ala Val He, Gly Ser ser Tyr Gly Phe Gin Tyr 
485 490 

TCA CCA GGA CAG CGG GTT GAA TTC 
ser Pro Gly Gin Arg Val Glu Phe 
500 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

«. Phe Phe Thr 01. Leu A.P «, V.1 tt. Bis tt, lb. »U «■ 



5 



„ on «. P~ «- Leu «, 01, U. Va! Ser Phe X, v.l «» - 

20 25 

HU =!« Tyr Pro VI Gl, ser «, M Pro *. «1« Pro Pro >*P 

35 * U 
«1 Ma V.! - » Ser Het M Thr M p Pro Ser His He Thr Ma 



1344 



1392 



1440 



14S8 



1512 



50 



55 60 
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Glu Ala Ala Gly Arg 



Arg Leu Ala Arg Gly Ser Pro Pro Ser Val Ala 



65 



70 



75 



80 



Ser Ser Ser Ala Ser Gin Leu Ser Ma Pro Ser Leu Lys Ma Thr Cys 
85 90 " 

Thr Ma Asn Hie Asp Ser Pre Asp Ma Glu Leu lie Glu Ma Asn Leu 

ioo i° 5 110 

Leu Trp Arg Gin Glu Met Gly Gly Asn lie Thr Arg Val Glu Ser Glu 
115 120 

Asn Lys Val Val He Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu 



130 



135 



140 



Asp Glu Arg Glu lie Ser Val Pro Ma Glu lie Leu Arg Lys Ser Arg 



145 



Arg Phe 



150 



155 



Ala Gin Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro 



165 



170 



175 



Pro Leu Val Glu Thr 
180 



Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val 



185 



190 



His Gly Cys 
195 



Pro Leu Pro Pro Pro Lys Ser Pro Pro Val Pro Pro Pro 



200 



205 



Xrg Lys Lys Arg Thr Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala 
210 215 220 

!« Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser Ser Ser Thr Ser Gly 
225 230 235 

lie Thr Gly Asp Asn Thr Thr Thr Ser Ser Glu Pro Ma Pro Ser Gly 
245 250 255 

Cy 8 Pro Pro Asp Ser Asp Ma Glu Ser Tyr Ser Ser Met Pro Pro Leu 
260 265 270 

Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr 

2S5 



275 



280 
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Val Ser Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys Ser Met Ser 
290 295 300 

Tyr Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin 
305 310 315 320 

Lys Leu Pro lie Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn 
325 330 335 

Leu Val Tyr Ser Thr Thr Ser Arg Ser Ala CyB Gin Arg Gin Lys Lys 
340 345 350 

Val Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val 
355 360 365 

Leu Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu 
370 375 380 

Ser Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser 
385 390 395 400 

Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala 
405 410 415 

Val Thr His lie Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val 
420 425 430 

Thr Pro He Asp Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val 
435 440 445 

Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro 
450 455 460 

Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val 
465 470 475 480 



Thr Lys Leu Pro Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr 
485 490 495 



Ser Pro Gly Gin Arg Val Glu Phe 
500 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to raRNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(C) INDIVIDUAL ISOLATE: Rodney 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 409-1-1 (c-a) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..477 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:7: 

S AA TTC CGC ACG CCC GCC GAG ^ ACA GTT C* «C CCG TAC ATG 

L Phe Arg Thr Pro Ala CI- Thr Thr Val Arg Leu Arg Ala Tyr 



AAC ACT CCC GGG CTT CCC GTC TGC CAC CAC GGA « - «C C« « 
Asn Thr Pro Gly Leu Pro Val Cys Gin Asp Gly He Pro Ser 



20 25 



ACC ACC GGA GAG ATC CCT TTT TAC GCC AAG CCT ATC CCC CTC CAA CTA 
Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala lie Pro 
35 40 

rrr pij CTC ATC TTC TGT CAT TCA AAG AAG AAG TGC 

ATC AAG GGG GGG AGA CAT CTC ATC 

lie Lys Gly Gly Arg His Leu lie Phe Cys Hx. Ser Lys Ly 
50 55 60 



48 



96 



144 



192 
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GAC GAA CTC GCC GCA AAG CTG GTC GCA TTG GGC ATC AAT GCC GTG GCC 
Z ^ *- Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala 



70 75 
65 7U 



T*C TAC CGC GGT CTT G»C GTG TCC GTC I1TC CCG ACC *CC GGC MT CTT 

^ m x. or i- -p ~ s « Vil Ile Pt0 Thr Mr " y "J 



90 95 
85 90 



GXC GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC GGC GAC TTC 
Val val val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe 

ioo i° 5 110 

GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT TTC 
Z Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 

120 125 



115 



AG C CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC ACG CTC CCC CAG GAT 
ser L Asp Pro Thr Phe Thr Ile Glu Thr Xle Thr Leu Pro Gin Asp 

130 " 5 140 

GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG ACT GGC ACG GAA TTC 

A la val ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Thr Glu Phe 
145 



(2) INFORMATION FOR SEQ ID NOs8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Glu Phe Arg Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 

1 5 
R sn Thr Pro Gly Leu Pro Val Cys Gin Asp Gly lie Pro Ser Pro Ser 

20 25 



240 



288 



336 



384 



432 



477 
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Thr Thr Gly Glu He Pro Phe Tyr Gly Lye Ala He Pro Leu Glu Val 
35 40 45 

lie Lys Gly Gly Arg His Leu lie Phe Cys His Ser Lys lys Lys Cys 
50 55 60 



Asp 
65 



Glu Leu Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala 
70 75 80 



Tvr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val 
85 90 95 

Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe 
100 105 HO 

Asp Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 
115 120 125 

Ser Leu Asp Pro Thr Phe Thr He Glu Thr He Thr Leu Pro Gin Asp 
130 135 140 

Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Thr Glu Phe 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: 409-1-1 (abc) 
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( ix ) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..558 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TCC ACC ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTC GAA 48 
Ser Thf Thr Gly Glu lie Pro Phe Tyr Gly Lye Ala He Pro Leu Glu 
1 5 10 15 

GTA ATC AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AAG AAG 96 
Val He Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Lys Lye 
20 25 30 

TGC GAC GAA CTC GCC GCA AAG CTG GTC GCA TTG GGC ATC AAT GCC GTG 144 
Cys Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val 
35 40 45 

GCC TAC TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC GAT 192 
Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp 
50 55 60 

GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC GGC GAC 240 
Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp 
65 70 75 80 

TTC GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT 288 
Phe Asp Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp 
85 90 95 

TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC ACG CTC CCC CAG 336 
Phe Ser Leu Asp Pro Thr Phe Thr He Glu Thr He Thr Leu Pro Gin 
100 105 110 

GAT GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG ACT GGC AGG GGG AAG 384 
Asp Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys 
115 120 125 

CCA GGC ATC TAC AGA TTT GTG GCA CCG GGG GAG CGC CCC TCC GGC ATG 432 
Pro Gly He Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met 
130 * 135 140 
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TTC GAC TCG TCC GTC CTC TGT GAG TGC TAT GAC GCA GGC TGT GCT TGG 480 ^ 

Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr ABp Ala Gly Cys Ala Trp 
14 5 150 155 160 



Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp 
65 70 75 80 

Phe Asp Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp 



TAT GAG CTC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGA GCG TAC ATG 528 
Tyr Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 
165 170 175 

AAC ACC CCG GGG CTT CCC GTG TGC CAG GAC 558 
Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
180 IBS 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu 
x 5 10 15 

Val He Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Lys Lys 
20 25 30 

Cys Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val 
35 40 45 

Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp 
50 55 60 



85 90 95 
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Phe Ser Leu Asp Pro Thr Phe Thr lie Glu Thr lie Thr Leu Pro Gin 
100 105 HO 

Asp Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys 
115 120 125 

Pro Gly lie Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met 
130 135 1^0 

Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp 
145 150 155 160 

Tyr Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 
165 170 175 

Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
180 1B5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to raRNA 
(iii) HYPOTHETICAL: NO 
( iv ) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C ViruB 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: GG1 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..657 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG 6GC ACG AAT CCT AAA CCT CAA AAA AAA AAC AAA CGT AAC ACC AAC 48 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
15 10 15 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 144 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 192 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 240 
lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 



TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 



288 



CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 



336 



CGG CGT AGG TCG CGC AAT TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 



384 



GGC TTC GCC GAC CTC ATG GGG TAC ATA CCG CTC GTC GGC GCC CCT CTT 
Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala Pro Leu 
130 135 140 



432 



GGA GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC 
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 



480 
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GGC GT6 AAC TAT GCA ACA GGG AAC CTT CCT GGT TGC TCT TTC TCT ATC 
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 "0 175 

TTC CTT CTG GCC CTG CTC TCT TGC TTG ACT GTG CCC GCT TCG GCC TAC 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 "0 

CAA GTG CGC AAC TCC ACG GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT 
Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
195 200 205 

AAC TCG AGC ATT GTG TAC GAG TAA TAG GGA TCC 
Asn Ser Ser He Val Tyr Glu * * Gly Ser 
210 215 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 



528 



1 



5 10 I 5 



Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 " 60 

lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 " 75 80 



576 



624 



657 
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Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 



Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 HO 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 



Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 " 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
195 200 205 



Asn Ser Ser He Val Tyr Glu * * Gly Ser 
210 215 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 
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(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: CapA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..657 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 48 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
1 5 10 15 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 144 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 192 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 240 
He Pro Lye Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 288 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 



CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 11° 

CGG CGT AGG TCG CGC AAT TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 



336 



384 
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GGC TTC GCC GAC CTC ATG GGG TAC ATA CCG CTC GTC GGC GCC CCT CTT 432 
Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala Pro Leu 
130 135 140 

GGA GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC 480 
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

GGC GTG AAC TAT GCA ACA GGG AAC CTT CCT GGT TGC TCT TTC TCT ATC 528 
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

TTC CTT CTG GCC CTG CTC TCT TGC TTG ACT GTG CCC GCT TCG GCC TAC 576 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 

CAA GTG CGC AAC TCC ACG GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT 624 
Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
195 200 205 

AAC TCG AGC ATT GTG TAC GAG TAA TAG GGA TCC 657 
Asn Ser Ser He Val Tyr Glu * * Gly Ser 
210 215 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
15 io 15 



t 



Arg 



Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 
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Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 4° 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 



65 



70 75 80 



Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
*" 100 105 HO 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
195 200 205 

Asn Ser Ser lie Val Tyr Glu * * Gly Ser 
210 215 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: eDNA to mRNA , 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE :• NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC450 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..453 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 
Met Glv Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
! 5 10 15 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 



35 



40 45 



ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 
He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
B5 90 95 



48 



96 



144 



192 



240 



288 



WO 91/15516 



PCT/US91/02370 



117 



~*r GGC TCT C6G CCT AGC TGG GGC CCC ACA GAC CCC 
CTC CTG TCT CCC COT GGC TCT CG ^ ^ ^ ^ ^ Pro 

Leu Leu Ser Pro Arg Gly ser Arg ^ 



, nn 105 
100 



„ m m™«iTl« QIC AIC GAT MC CTT AGO IGC 

cgg r; z z z z z. «, - - teu Thr ■** 

Arg Arg Arg Ser Arg * 12g 



120 



Gly Phe Ala Asp Leu Met Gly Tyr ^ 



130 



GGA GGC GCT GCC AGG GCC TA 
Gly Gly Ala Ala Arg Ala 
145 



( 2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l6: 

« G1 , - ~ «• •» - - - ws M9 M " Ts " 



i 5 



^ « g „ «, MP V- * » -V "V «- I- - 

20 

01y va> Tvr - - «• « « « 7s ^ 

. « - ~ - « «" ~ « " 9 

n. « - - - » - "* * P " "° 

70 10 



65 



336 



384 



432 



453 
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Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
H5 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 



Gly Gly Ala Ala Arg Ala 
145 * 150 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC360 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..360 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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CGG CGT AGG TCG CGC AAT TTG TA 
Arg Arg Arg Ser Arg Asn Leu 

115 120 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) HOLE COLE TYPE: protein 



48 



96 



119 

ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
1 5 10 15 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 144 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 4s 

ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 192 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 240 
He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 288 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC 336 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 no 



360 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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Met Gly Thr Asn Pro Lys Pro Gin Lys Lye Asn Lys Arg Asn Thr Asn 

1 5 10 15 r 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly » 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arg Arg Arg Ser Arg Asn Leu 
115 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC270 

( ix ) FEATURE.: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..273 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 48 
Met Gly Thr Aen Pro Lye Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
1 5 10 15 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 



20 



25 



30 



GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 



144 



Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 



ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 



192 



Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 



ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 



240 



lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 



TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TA 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
85 90 



273 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

r 

Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 

1 5 10 15 * 

Arg Arg Pro Gin Asp Val Lye Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
85 90 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C100NC270 

( ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) LOCATION: 1..183 



(xi) SEQUENCE DESCRIPTION: SEQ-ID NO: 21: 

ATG GGT GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG 
Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 
! 5 10 15 

CGC GCG ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT 
Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 
20 25 30 

CAG CCT ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG 
Gin Pro He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin 
35 40 45 

CCC GGG TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TA 
Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acidB 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 
1 5 10 « 

Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 
20 25 30 



Gin Pro 

35 



He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin 



48 



96 



144 



183 



40 45 
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Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
50 55 60 



r 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C100NC360 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..270 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ATG GGT GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG 48 
Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 
15 10 15 

CGC GCG ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT 96 
Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 



20 



25 



30 



CAG 
Gin 



CCT ATC CCC AAG GCT CGT CGG CCC 
Pro lie Pro Lys Ala Arg Arg Pro 
35 40 



GAG GGC AGG ACC TGG GCT CAG 
Glu Gly Arg Thr Trp Ala Gin 
45 



144 
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CCC GGG TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG 192 
Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala 
50 55 60 

GGA TGG CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA 240 
Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 
65 70 75 80 

GAC CCC CGG CGT AGG TCG CGC AAT TTG TA 270 
Asp Pro Arg Arg Arg Ser Arg Asn Leu 

85 90 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 
1.5- 10 15 

Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 
20 25 30 

Gin Pro lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin 
35 40 45 

Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala 
50 55 60 

Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 
65 70 75 80 

Asp Pro Arg Arg Arg Ser Arg Asn Leu 
85 
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(2) INFORMATION FOR SEQ ID NO: 25: 



t 



(i) 



SEQUENCE CHARACTERISTICS 5 
(A) LENGTH: 106 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC105 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..106 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 48 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 



1 



5 



10 



15 



CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 



96 



GGA GTT TTA A 



106 



Gly Val Leu 
35 



(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Gly Thr Asn Pro Lys Pro Gin Lye Lys Asn Lys Arg Asn Thr Asn 
1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 



20 



25 



30 



Gly Val Leu 
35 
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*21 CGGACGCCACAIAC 
GCCXCCGGTGTRIG, 




481 sssssss~ 




551 ^^^SgJS^SSS 



721 





501 * ccn:ST ** I,cmOTa ^Tcccrarece^ 
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ACAAffl^CGAAACCnAmTAAACrTCr 

1081 ?2S2!^^ 

1141 sssssssss 
1501 sss^g^g^^ 

1351 ss^sgs^ss^s^^ 

1321 ^^^^^^^SS!^^^^ 

1381 Sg^^S^-gg--j~ 



1501 




1551 sg^^ss^sgs^^ 

1621 ^^^^SSgSSSgggffig 



1581 
1741 




1801 

1861 SffigS^S^IgSg^iS{iS 
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1921 TOiCEXC»CGCMG55TCCa!TCAICC^T^ 

ACCCCAGT GSS X IL CCACSACACTACGTCSACATATCG^ 

1981 GCTGGCCCGCXCCGCAAGSTAGCCGCTCAXTCACACCaSCA^ 

CGACCGGGO»GGC£TTreATCGGCGAGTAACTGIGGGAaErGAAC^^ 

2041 TIXACCIGGICACGAGGCACGCCGATGTC^ITCCCGIGCGC^^ 

AAATGCACCftGTGCTCCGTGCGGCTACAGTAAGGGCACGCreCCGCCCCACIAICGTCCC 

2101 GCAG^IGClGItlGCCCC^CCCAXTTCC^^ 

CGTO^ACGACAGeGGGGCCGGGTAAAGGATGAACITTCa^^ 

21G1 TGTGCCCCGCGGGGCACGCanGGG^TWTTAGGGCCGCGGTGTGGVCCaTGG 

ACACGGGGCGCCCCGTGCGGCXCCCGTAlAAATCCaMCGCCkakCGTGGGCACCTG^CC 

2221 CIAAGGCGGIGGACTTTATCCCTGTGGAGAACCTAGAGACAACCAIGAGG^ 
GaTTCCGCCACriGAAATAGgGACACCTCTO 

2281 ICACGGAX^^ 

AGTGCCIATTGAGGAGAGGIGGTCATCACGGGGTCTCGAAGGTCCACQSAGIG^ 

23*1 acocac aGccascOT 

<aGSGTGICCGTCCCCGT3 ^^ 

2401 

TOaaaTCATGAGTIGGGGAGACAA(^CGTTG2GACCCXAAACCAC^^ 

2461 25525^^ 

KCGAG»CCCaGClAGGATTGTAGTCCTGGCCCCACTC=TGTIW 

2521 sssss^^ 

TftCIGimaiTAAACftCIGCTCaaaraCAGGlXXCIAC^^ 
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2701 £5255^ 

GGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCICCTCCAACGAGACAGGTGGT 

G^ClCICiaGGGAAAAAlGCCGTTCCGAIAGGGGGAGCITCATTAGTlCCCCCCCTCIG 

2821 ATCTCATClTrMGTCATTCAAAGAAGAACTCCGA&AACrffi 
™»CTAGMGACACaA^ 

2881 tgggc&tcaatgccgtggcctactaccgotgtcttgacgtgtccgtcatcccgaccagcg 
acccctagtiacggcaccggatgatggcgccagaactccacaggS 
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2941 <ra aiUf I UrU. 'I C6 I G G CA&C CGAI<^CCTC^^ 

CGCIACAACACCAGCACCGTIGGClACGGGACTACTCGCCGATATGGCCGCTt^AGCTGA 



3001 CGGTGAIAGACIGCAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCT 
GCCACiaiCIGACGIXATGCACACAGTGGGTCIGICaGCIAAAGTCGGAA 



3061 TOCCAIIGAGACAAICACGCICCCCCAGGATGCIGTCTCCe^^ 
AGTGCT^CTCTGTTAGTa^GGa^rCCTA^ 



3121 QSZCXGGaGGGGGZZGCaGGCHXmai ^ 

CCIGACOTCCaXITCGGTCCG13U»AIGTCI^CACCGT^ 



3181 CC A l W U U»A CCa ; i C rai CC T C ltVlt^Ura 

CGXACAAGCIGAGCAGGCAGGAGACACICACGATACTGCGTCCGACACGAACCATACTCG 



3241 TOVCGCCCGCCGAGACTACAGTTW^CTACGAGCGTACAIGAACVCCCCGGGGCTTCCCG 
AGTGCGGGCGGCICIGATGTCAATCCGAIGCTCGCATGIACTTGTGGGGCCCCGAAGGGC 



3301 TGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTZACAGGCCTCACTCATAXAGATG 



3361 CCCA LXTXt. ' i l A TCCCRGaCAAAGCAGAGTGGGGAGAAC Lr.I ' CCl ' lA CCTGGTAGCGTACC 



3421 AAGCCACCGTGlGCGCTAGGGCTa^AGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGT 
TT C G G TGGCAC A CGCGAICCCGAGTT CGG GGAGGGGGTAGCACCCIGGTCIACACCTTCA 



3481 GTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTAXACAGACTGGGCG 
CAAACTAAGGGGAGTTCGGGTGGGAGG3ACCCGGTTGTGGGGAd3AXAiSTC3GACCCGC 



3541 CTGT TCAGAA I GAAAT CACCCTGACGCACCCAGTG^CCAAATAG^TCATGACATGCATGT 
GACAAGTCTTACTTIAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGXACTG1ACGTACA 



3601 CGGCCGACCTGGAGGTaSICACGAGCACCTGG G T GC TCGlT GG CGGCGTCCIGGCTGCTT 
GCCGGClGSACCICCAGCAGlGCTQnEGACCCACGAGCAACCGCCGCAGGACCGACGAA 



ACCgWgCAXAAQK»CAGTIGTCCGAC^^ 



3721 GGAAGCOGGCAATCAXACClt^CAGGGAAGTCCTCTACCGAGAGTTCGATCAGATGGAAG 
CCTTCGGCQniAGTAlGGACIGTCCCTTCAGGAGATGGCTCTCAAGCTACrCTACCTTC 



3781 AGIGCTCTCAGCACTXACCGTACATCGAGCAAGGGATGATGCTCGCCGAGC^ 

TCACGAGAGXCGTGAATGGCATGTAGCTCGXTCCCTACTACGAGCGGCTCGTCAAGTTCG 



3841 AGAAGGCOCICGGC^^^CAGACCGC G T CCG jXX I AGGCAGAGGTTATCGC CC nPG C lXi 

~ ~ ~~ — * * ^^^m^ MMMM* MMMjtaa. — — — 





CCCTCTTGGAAGGAATGGACCATCGCATGG 
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3901 SS2^SS»£^^ 

3961 sss^ ' 
4021 iSSSS^S^^ 

4Ml 

4141 ZEgSgg&gE&gggESSgSg 

4201 AcaiccirccagCGiarGGCG^ 

4261 sssss^^ 

4381 SgKSS^ 



4441 ^^i^g^^S^3S 

4501 ^^^gss^ssg^ 
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IT IS CLAIMED: 

1. A recombinant polypeptide which is immunoreactive 
with sera from humans infected with hepatitis C virus 
(HCV) . 

5 

2, The polypeptide of claim 1, which includes an 
immunoreactive portion of an HCV polypeptide which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

10 c) has the carboxy-terminal sequence presented as SEQ 

ID NO: 4. 



3. The polypeptide of claim 2, which includes the 
entire 504 residue peptide, 

15 

4 . The polypeptide of claim 3 , whose carboxy- 
terminal amino acid sequence is encoded by the polynucleo- 
tide sequence presented as SEQ ID NO: 3, 

2 0 5. The polypeptide of claim 3, which is produced by 

the expression vector contained in an Escherichia coli host 
identified by ATCC No. 40901. 

6. The polypeptide of claim 1, which includes an 
25 immunoreactive portion of the peptide sequence presented as 

SEQ ID NO: 2. 

7. The polypeptide of claim 6, which includes the 
specified sequence. 

30 

8. The polypeptide of claim 7, which is encoded by 
the polynucleotide sequence presented as SEQ ID N0:1. 
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9. The polypeptide of claim 7, which is produced by 
the expression vector contained in an Escherichia coli host 
identified by ATCC No. 40893. 

5 10. The polypeptide of claim 1, which includes the 

peptide sequence presented as SEQ ID NO: 8. 

11. The polypeptide of claim 10, which is encoded by 
the polynucleotide sequence presented as SEQ ID NO: 7. 

10 

12. The polypeptide of claim 1, which includes the 
peptide sequence presented as SEQ ID NO: 10. 

13. The polypeptide of claim 12, which is encoded by 
15 the polynucleotide sequence presented as SEQ ID NO: 9. 

14. The polypeptide of claim 1, which includes a 
peptide sequence selected from the group consisting of the 
peptide sequences presented as SEQ ID NO: 12, SEQ ID NO: 14, 

20 SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID NO: 22, SEQ 

ID NO: 24, and SEQ ID NO: 26. 

15. The polypeptide of claim 1, which is encoded by 
a polynucleotide sequence selected from the group of poly- 

25 nucleotide sequences presented as SEQ ID NO: 11, SEQ ID 
NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID 
N0:21, SEQ ID NO:23, and SEQ ID NO: 25. 

16. A diagnostic kit for use in screening human blood 
30 containing antibodies specific against hepatitis C virus 

(HCV) infection comprising 

at least one peptide antigen which is immunoreactive 
with sera from humans infected with hepatitis C virus 
(HCV), and 

35 means for detecting the binding of said antibodies to 

the antigen. 
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17. The kit of claim 16 , wherein the antigen includes 
an immune-reactive portion of an HCV polypeptide sequence 
which: 

a) is encoded by an HCV coding sequence; 
5 b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO: 4. 

18. The kit of claim 16, wherein the antigen includes 
10 an immunoreactive portion of the peptide sequence presented 

as SEQ ID NO: 2. 

19. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of the peptide sequence presented 

15 as SEQ ID NO: 8. 



20. The kit of claim 19, which further contains a 
second antigen, wherein the second antigen includes an 
immunoreactive portion of a peptide sequence selected from 
the group consisting of the peptide sequences presented as 
SEQ ID N0:12, SEQ ID N0:14, SEQ ID NO:16, SEQ ID N0:18, SEQ 
ID N0:20, SEQ ID N0:22, SEQ ID NO:24, and SEQ ID N0:26. 

21. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of the peptide sequence presented 
as SEQ ID NO: 10. 

22. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of a peptide sequence selected 
from the group consisting of the peptide sequences present- 
ed as SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID 
NO:18, SEQ ID N0:20, SEQ ID NO:22, SEQ ID NO:24, and SEQ ID 
NO:26. 



35 



23. The kit of claim 16, wherein said detecting means 
includes a solid support to which said peptide is attached 
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and a reporter-labeled anti-human antibody, wherein binding 
of said serum antibodies to said antigen can be detected by 
binding of the reporter-labeled antibody to said solid 
surface. 

5 

24. A method of detecting hepatitis C virus (HCV) 
infection in an individual comprising 

reacting serum from an HCV- infected test individual 
with at least one peptide antigen which is immunoreactive 
10 with sera from humans infected with hepatitis C virus 
(HCV) , and 

examining the antigen for the presence of bound 
antibody. 

15 25. The method of claim 24, wherein the peptide 

antigen includes an immunoreactive portion of an HCV 
peptide which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

20 c) has the carboxy-terminal sequence presented as SEQ 

ID NO: 4. 

26. The method of claim 24, wherein the peptide 
antigen includes an immunoreactive portion of the sequence 

25 presented as SEQ ID NO: 2. 

27. The method of claim 24, wherein the peptide 
antigen includes an immunoreactive portion of the sequence 
presented as SEQ ID NO: 8. 

30 

28. The method of claim 24, which further contains a 
second antigen, wherein the second antigen includes an 
immunoreactive portion of a peptide sequence selected from 
the group consisting of the peptide sequences presented as 

35 SEQ ID NO: 12, SEQ ID N0:14, SEQ ID N0:16, SEQ ID N0:18, SEQ 
ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, and SEQ ID NO: 26. 
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29. The method of claim 24, wherein the peptide 
antigen includes an immunoreactive portion of the sequence 
presented as SEQ ID NO: 10. 

5 30. The method of claim 24, wherein the antigen 

includes an immunoreactive portion of a peptide sequence 
selected from the group consisting of the peptide sequences 
presented as SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ 
ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, and SEQ 
10 ID NO: 26. 

31. The method of claim 24, wherein the peptide 
antigen is attached to a solid support, said reacting 
includes reacting the peptide antigen with the support, and 
15 subsequently reacting the support with a reporter-labeled 
anti-human antibody, and said examining includes detecting 
the presence of reporter-labeled antibody on the solid 
support . 

20 32. A method of producing a polypeptide which is 

immunoreactive with sera from humans infected with hepati- 
tis C virus (HCV) , comprising 

introducing into a suitable host, a recombinant 
expression system containing an open reading frame (ORF) 

25 having a polynucleotide sequence which encodes a polypep- 
tide which is immunoreactive with sera from humans infected 
with hepatitis C Virus (HCV) , where the vector is designed 
to express the ORF in said host, and 

culturing said host under conditions resulting in the 

3 0 expression of the ORF sequence. 

33. The method of claim 32, wherein the polypeptide 
includes an immunoreactive portion of an HCV polypeptide 
which: 

35 " a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 
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c) has the carboxy-terminal sequence presented as SEQ 
ID NO: 4; and 

where the carboxy-tenainal amino acid sequence of said 
peptide antigen is encoded by the polynucleotide sequence 
5 presented as SEQ ID NO: 3. 

34. The method of claim 33, wherein the expression 
vector is a lambda gtll phage vector , the host is E. coll, 
and the host containing the introduced vector is identified 

10 by ATCC No. 40901. 

35. The method of claim 32, wherein the polypeptide 
has the sequence presented as SEQ ID NO: 2, and the 
polynucleotide has the sequence presented in SEQ ID NO:l. 

15 

36. The method of claim 35, wherein the expression 
vector is a lambda gtll phage vector, the host is E. coli, 
and the host containing the introduced vector is identified 
by ATCC No. 40893. 

20 

37. The method of claim 32, wherein the polypeptide 
has the sequence presented as SEQ ID NO: 8, and the polynu- 
cleotide has the sequence presented as SEQ ID NO: 7. 

25 38. The method of claim 37, wherein the expression 

vector is a lambda gtll phage vector, the host is E. coli, 
and the host containing the introduced vector is identified 
by ATCC No. 40792. 

30 39. The method of claim 32, wherein the polypeptide 

has the sequence presented as SEQ ID NO: 10, and the 
polynucleotide has the sequence presented as SEQ ID NO: 9. 
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40. The method of claim 39, wherein the expression 
vector is a lambda gtll phage vector, the host is E. coli, * 
and the host containing the introduced vector is identified 
by ATCC No. 40876. " 

5 

41. The method of claim 32, wherein the polypeptide 
includes an immunoreactive portion of a peptide sequence 
selected from the group consisting of the peptide sequences 
presented as SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ 

10 ID N0:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, and SEQ 
ID N0:26. 

42. The method of claim 41, wherein the expression 
vector is a pGEX or pET vector and the host is E. coli. 

15 

43. An expression system for expressing a recombinant 
peptide antigen which is immunoreactive with sera from 
humans infected with hepatitis C virus (HCV) , comprising 

a host capable of supporting expression of an open 
20 reading frame in a selected expression vector, and 

the selected expression vector containing an open 
reading frame (ORF) having a polynucleotide sequence which 
encodes said peptide antigen. 

25 44. The expression system of claim 43, wherein the 

peptide antigen includes an immunoreactive portion of an 
HCV polypeptide sequence which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

30 c) has the carboxy-terminal sequence presented as SEQ 

ID NO: 4; and 

where the carboxy-terminal amino acid sequence of said 
peptide antigen is encoded by the polynucleotide sequence 
presented as SEQ ID NO: 3. 



35 
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45. The expression system of claim 44, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, and the host containing the introduced vector 
is identified by ATCC No. 40901. 

5 

46. The expression system of claim 43, wherein the 
peptide antigen has the sequence presented as SEQ ID NO: 2, 
and the polynucleotide has the sequence presented as SEQ ID 
NO:l. 

10 

47. The expression system of claim 46, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, and the host containing the introduced vector 
is identified by ATCC No. 40893. 

15 

48. The expression system of 43, wherein the peptide 
antigen has the sequence has the sequence presented as SEQ 
ID NO: 8, and the polynucleotide has the sequence presented 
as SEQ ID NO: 7. 

20 

49. The expression system of claim 48, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, and the host containing the introduced vector 
is identified by ATCC No. 40792. 

25 

50. The expression system of 43, wherein the peptide 
antigen has the sequence has the sequence presented as SEQ 
ID NO: 10, and the polynucleotide has the sequence presented 
as SEQ ID NO: 9. 

30 

51. The expression system of claim 50, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, and the host containing the introduced vector 
is identified by ATCC No. 40876. 



35 
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52. The expression system of claim 43, wherein the 
polypeptide includes an immunoreactive portion of a peptide 
sequence selected from the group consisting of the peptide 
sequences presented as SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 

5 N0:16, SEQ ID N0:18, SEQ ID N0:20, SEQ ID N0:22, SEQ ID 

NO: 24, and SEQ ID NO: 26. 

53. The expression system of claim 52, wherein the 
expression vector is a pGEX or pET vector and the host is 

10 E. coli. 

54 . A polynucleotide which encodes a polypeptide 
which is immunoreactive with sera from humans infected with 
hepatitis C virus (HCV) . 

55 . The polynucleotide of claim 54 , wherein the 
polypeptide includes an immunoreactive portion of a peptide 
sequence which: 

a) is encoded by an HCV coding sequence; 
20 b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO: 4; and 

where the carboxy-terminal amino acid sequence of said 
peptide antigen is encoded by the polynucleotide sequence 
25 presented as SEQ ID NO: 3. 

56. The polynucleotide of claim 54, which includes 
the polynucleotide sequence presented as SEQ ID N0:1. 

30 57. The polynucleotide of claim 54, which includes 

the polynucleotide sequence presented as SEQ ID NO: 9. 

58. The polynucleotide of claim 54, which includes a 
polynucleotide sequence selected from the group of polynu- 
35 cleotide sequences presented as SEQ ID NO: 11, SEQ ID NO: 13, 
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SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ 
ID NO:23, and SEQ ID NO:25. 
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GAA TTC CTC GTG CAA GCG TGG AAG TCC AAG AAA ACC CCA ATG GGG TTC 
TCG TAT GAT ACC CGC TGC TTT GAC TCC ACA GTC ACT GAG AGC GAC ATC 
CGT ACG GAG GAG GCA ATC TAC CAA TGT TGT GAC CTC GAC CCC CAA GCC 
CGC GTG GCC ATC AAG TCC CTC ACC GAG AGG CTT TAT GTT GGG GGC CCT 
CTT ACC AAT TCA AGG GGG GAG AAC TGC GGC TAT CGC AGG TGC CGC GCG 
AGC GGC GTA CTG ACA ACT AGC TGT GGT AAC ACC CTC ACT TGC TAC ATC 
AAG GCC CGG GCA GCC TGT CGA GCC GCA GGG CTC CAG GAC TGC ACC ATG 
CTC GTG TGT GGC GAC GAC TTA GTC GTT ATC TGT GAA AGC GCG GGG GTC 
CAG GAG GAC GCG GCG AGC CTG AGA GCC TTC ACG GAG GCT ATG ACC AGG 
TAC TCC GCC CCC CCC GGG GAC CCC CCA CAA CCA GAA TAC GAC TTG GAG 
CTC ATA ACA TCA TGC TCC TCC AAC GTG TCA GTC GCC CAC GAC GGC GCT 
GGA AAG AGG GTC TAC TAC CTC ACC CGG 
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■ STTGEI PFYGKAIPLE 
CC ACC ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTC GAA 

VIKGGRHLIFCHSKKK 
GTA ATC AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AAG AAG 

CDELAAKIVALGINAV 
TGC GAC GAA CTC GCC GCA AAG CTG GTC GCA TTG GGC ATC AAT GCC GTG 

AYYRGLDVSV IPTSGD 
GCC TAC TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC GAT 

VVVVATDALMTGYTGD 
GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC GGC GAC 

FDSVIDCNTCVT QTVD 
TTC GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT 

FSLDPTFTIETITLPQ 
TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC ACG CTC CCC CAG 

D A V S RT Q RRG R T G ■ R G K 
GAT GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG ACT GGC AGG GGG AAG 

PGIYRFVAPGERPSGM 
CCA GGC ATC TAC AGA TTT GTG GCA CCG GGG GAG CGC CCC TCC GGC ATG 

FDSSVLCECYDAGCAW 
TTC GAC TCG TCC GTC CTC TGT GAG TGC TAT GAC GCA GGC TGT GCT TGG 

Y E LAT PAETTVRLRAYM 
TAT GAG CTC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGA GCG TAC ATG 

NTPGLPVCQD* 
AAC ACC CCG GGG CTT CCC GTG TGC CAG GAC FXQ • 5 
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Epitope Comparison/Delineation 

225 2754 3024 

^ jf Clone 33C 

(799bp) 

Clone 33C-U 3311 

* 



(529bp) 

409.1.1 (557bp) 



409.1.1 F1R2 (270bp) 3129 



409.1.1 (a) (375bp) 

3129 



409.1.1 (ca) (445bp) 



409.1.1 (u) (105bp) 
409.1.1 (C) (70bp) 



409.1.1 (c+270) 
(340bp) 



i 



Fig. 7 
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6 6 6 

1 CC ATG 66C AC6 ATT CCC AAA CCT CAA AAA AAA AAC AAA C6T 
MGTI PKPQKKN K R 
AAC ACC AAC CGT CGC CCA CGG 6AC 6TC AAG TTC CCG GGT 66C 

NTNR RPRDVKF- PG G 
GGT CAG ATC GTT GGT GGA GTT TAC TTG TTG CCG CGC AGG GGC 

6 Q IVGG VYLLPR RG 
CCT AGA TTG GGT GT6 CGC GCG ACG AGA AAG ACT TCC GAG CGG 

PRLGVRATR K TSER 
TCG CAA CCT C6A 6GT A6A CGT CAG CCT ATC CCC AAG GCT CGT 
SQP RGRR QPI p KAR 

CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG TAC CCT TGG 

RPEG RTWAQPGY Pw 
CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG CTC 

PLYGNEGCGWA6WL 
CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC 

LSPR 6SRPSWGPTD 
CCC CGG CGT AGG TCG CGC AAT TTG GGT AAG GTC ATC GAT ACC 

PRRR SRNLGKVIDT 
379 CTT ACG TGC GGC TTC GCC GAC CTC ATG GGG TAC ATA CCG CTC 

LTCGFADLMGYIpl 
421 GTC 6GC 6CC CCT CTT 66A 66C 6CT GCC AGG GCC CTG 6C6 CAT 

VGAPI -GGAARALAH 
463 GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA ACA GGG 

GVRV LEDGVNYATG 
505 AAC CTT CCT 6GT T6C TCT TTC TCT ATC TTC CTT CT6 GCC CTG 

N LPG C SPSIPLLAL 
547 CTC TCT TGC TTG ACT GTG CCC GCT TCG GCC TAC CAA GT6 C6C 

LSCL TVPASAYQ V R 
589 AAC TCC AC6 GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT AAC 

NSTG LYHVTNDCP N 
631 TCG AGC ATT GTG TAC GAG TAA TAG GGA TCC 659 

SSIVYEZZGS 



43 
85 
127 
169 
211 
253 
295 
337 



Pig. 8A 



WO 91/15516 



PCT/US91/02370 



10/12 



10 20 30 



40 50 



60 



i i \ 

AAA 
(wild Type) 

Primer CI 
METGlyrhrAsnPro^^^ 

120 



70 80 90 100 HQ 



GACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGG 

qaaeeea^ rracrtCBar;'PTTACTTG'PTGCg -.s. 
Primer CI 00 

*~ TCAGATPffTTGGTggaflTO taataggga^^^ 
primer NC105 ™" 

AspValLysPheProGlyGlyGlyGlnlleValGlyGlyValTyrl^uLeuProArgArg 



130 140 150 i 6 o i 7 o 



180 



GGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGAGCGGTCGCAACCTCGAGGT 
GlyProArgLeuGlyValArgAlaT^^^^ 

240 



190 200 210 220 230 



AGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAGGACCTGGGCTCAGCCCGGG 
ArgArgGlnProIleProLysAlaArgArgProGluGlyArgT^AlaGlnProGly 



250 260 270 280 



290 300 



TA=CC CT0G CC=CXC Ja x 0 ^ S ^^^^ BSCTCCICTcrccc 

ocn^m.™- Primer C270-> 

<— CTCTATGGCAATfiA<sggr»n» " aa a i- P , r ,„„ r . f , 
m. « m < -Primer ^ 

^rProTrpProLeuTyrGlyAsnGluGlycysGlyTrpAlaGlyTrpLeuLeuSerPro 



Fig. 8B 
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310 320 330 340 350 360 

I I I I ! 

CGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCGTAGGTCGCGCAATTTGGGT 

cracrcccaTGGGT 
(C360->) 

< — gGCGTAGGTCGCGCAATTTGtaa 
qqatccacrcc <-»air K360 . 

.ArgGlySerArgProSerTrpGlyProThrAspProArgArgArgSerArgAsnLeuGly . 

370 3 80 390 400 410 420 

i 

AA^T^CG^^CTTACGTGCGGCTTCGCCGACCTCATGGGGTACATACCGCTCGTC 
Primer C360-> 

LysVallleAspThrLeuThrCysGlyPheAlaAspLeuMETGlyTyrlleProLeuVal 
430 440 450 460 470 480 



GGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAAGAC 
< — GGAGGCGCTGCPAgflRCCtaaggafeflntTrron 
<-- Prxmer NC450 — 

GlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAlaHisGlyValArgValLeuGluAsp 
490 500 510 520 530 540 

'ill!! 

GGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTTCTCTATCTTCCTTCTGGCC 
< — GCAACAGGGAACCTTCgTGGTTaaaaatccaacc 
< — Primer NC520 

GlyValAsnTyrAlaThrGlyAsnLeuProGlycysSerPheSerllePheLeuLeuAla 
550 560 570 580 590 600 

! ! ! ! 

CTGCTCTCTTGCTTGACTGTGCCCGCITCGGCCTACCAAGTGCGCAACTCCACGGGGCTT 
<-z£2£IGCCCGCTTCGGCCTAagaatccgaTC 
< — Primer NC580 — 

LeuLeuSerCysLeuThrValProAlaSerAlaTyrGlnValArgAsnSerThrGlyLeu 

610 620 630 640 650 

TACCACGTcAcCAATGATTGCCCTAACTCGAGCATTGTGiACGAGTAATAGGGATCC 

< — GTACGAGTA ATAGGGATCPcra a 
< — Primer NC660 — 
(3 'primer) 

TyrHisValThrAsnAspCysProAsnSerSerlleValTyrGlu Glyser 

Fig. 8B (con't) 
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80 120 |60 

AMINO ACID RESIDUES 



200«#-3' 

NC660 



Fig- 9 



6GI 

C1NC450 
C1NC3 60 
C1NC270 
C1NC105 

C100NC270 
C100NC360 
C270NC360 
C270NC450 
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